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3  Determination  of  Military  Requirements,  Scientific 

Progress,  and  Accomplishments 


The  Focused  Research  Initiative  (FRI)  group  from  Purdue  University,  the  University  of 
Illinois,  the  University  of  Michigan,  Raytheon  Systems  Company  (formerly  Hughes  Defense 
Communications  and  formerly  Magnavox),  and  Hughes  Network  Systems  (HNS)  began  work 
in  July  of  1995.  Technical  efforts  are  concentrated  on  multimedia  wireless  communication 
links,  routing  and  congestion  control,  the  interface  to  and  use  of  high  speed  networks,  adap¬ 
tive  coded  modulation  for  spread-spectrum  communication  systems,  and  adaptive  antenna 
arrays.  Our  industrial  partners  have  completed  a  requirements  analysis  stage  with  the  pur¬ 
pose  of  guiding  our  work  and  making  it  relevant. 

Communications  requirements  and  goals  have  been  obtained  through  interview  and  dis¬ 
cussion  with  various  Army  organizations.  These  requirements  and  goals  are  those  that  were 
specifically  identified  as  important  to  the  organization  and  its  objectives.  We  summarize  in¬ 
formation  obtained  in  an  updated  requirements  matrix  that  is  shown  in  vol.  2  of  this  report. 
The  requirements  matrix  is  divided  into  six  segments  depending  on  the  specific  application: 
the  Task  Force  XXI  Brigade  (force  level  communication  exercise),  Untethered  Node  Op¬ 
eration,  Future  Digital  Radio,  High  Capacity  Trunk  Radio,  Information  Management,  and 
General  Communication  Requirements.  The  requirement  or  goal  is  identified  in  a  statement, 
and  a  key  is  provided  to  identify  the  source  of  the  requirement  or  goal.  Universities  with 
projects  that  address  these  requirements  and  goals  have  now  been  identified  in  our  updated 
requirements  matrix. 

Some  details  follow  which  describe  our  interactions  among  the  universities  and  industry. 
Brief  descriptions  of  accomplishments  are  given  with  details  and  graphics  provided  in  the 
appended  selection  of  research  papers  and  reports. 
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3.1  Wireless  Multimedia  Networks,  Routing  and  Congestion  Con¬ 
trol,  Interface  of  Wireless  Networks  to  High-Speed  Wide-Area 
Wireline  Networks  (Bruce  Hajek,  Univ.  of  IL;  U.  Madhow, 
Univ.  of  IL;  E.  Chong,  Purdue  Univ.) 

3.1.1  Capture  Probability  for  a  Large  Number  of  Stations 

The  probability  of  capture  under  a  model  based  on  the  ratio  of  largest  received  power  to 
sum  of  interference  powers  is  examined  in  [20]  in  the  limit  of  a  large  number  of  transmitting 
mobiles.  Specifically,  if  n  transmitters  are  present,  the  transmission  of  a  given  mobile  j  is 
captured  if  the  power  received  from  that  mobile  exceeds  a  threshold  z  times  the  sum  of 
the  powers  received  from  the  other  mobiles  and  the  thermal  noise.  The  parameter  z  is  the 
minimum  carrier-to-interference  ratio  needed  for  successful  reception,  and  is  determined  by 
such  factors  as  the  type  of  modulation  and  the  receiver  sensitivity.  In  a  land  mobile  radio 
system,  the  received  power  from  a  remote  mobile  at  distance  r  can  be  reasonably  modeled 
as  the  product  of  the  transmitted  power,  a  Rician  or  Rayleigh  distributed  random  variable 
to  account  for  multipath  fading,  an  exponential  of  a  Gaussian  random  variable  to  account 
for  shadowing,  and  a  power-law  term  to  account  for  the  near-far  effect. 

The  paper  [20]  makes  two  points:  (1)  In  the  limit  of  a  large  number  of  transmitters,  the 
probability  of  capture  is  a  simple  function  depending  only  on  z  and  the  roll-off  parameter 
of  the  distribution  of  power  received  from  a  typical  mobile,  and  (2)  under  broad  conditions 
the  roll-off  parameter  is  determined  by  the  near-far  effect  (through  the  power  roll-off  with 
distance  and  the  spatial  distribution  of  mobiles),  and  is  insensitive  to  many  typical  channel 
effects  such  as  Rayleigh  or  Rician  fading  and  log-normal  shadowing. 


3.1.2  Packet  Scheduling  with  Deadlines  and  Priorities 

Causal  scheduling-dropping  policies  for  multiclass  traffic  with  deadlines  have  been  developed. 
Packets  with  deadlines  and  class  labels  arrive  in  discrete  time  and  are  either  scheduled  by 
their  deadlines,  or  dropped  on,  or  possibly  before,  their  deadlines.  This  work  is  motivated  by 
the  problem  of  scheduling  packet  transmissions  in  a  radio  network.  Typically,  only  limited 
bandwidth  is  available.  It  is  therefore  crucial  that  the  nodes  carefully  determine  the  order  in 
which  they  transmit  packets  over  their  outgoing  links.  There  is  a  tradeoff  between  respecting 
the  priority  class  of  a  packet,  and  excessively  delaying  lower  priority  traffic.  Our  work  has 
addressed  this  tradeoff  in  a  principled  way. 

Initially,  our  research  focused  on  the  case  in  which  there  are  two  classes  of  traffic,  and 
on  causal  policies  which  at  any  given  time  base  decisions  on  arrivals  up  to  that  time.  These 
results  were  written  up  in  [38]  and  presented  in  [39].  First,  the  causal  scheduling-dropping 
policies  that  maximize  total  throughput  are  identified.  Then,  those  scheduling-dropping 
policies  with  the  greatest  throughput  for  the  high  priority  class,  subject  to  being  causal  and 
maximizing  total  throughput,  are  identified. 
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In  late  1997  and  early  1998  our  attention  turned  to  improving  our  results  in  two  ways:  (a) 
handling  more  than  two  priority  classes,  and  (b)  devising  computationally  efficient  algorithms 
for  scheduling  and  dropping.  The  results  of  this  phase  of  research  are  summarized  in  [?]. 
Prom  the  summer  of  1998  until  the  present,  we  have  continued  to  work  on  this  problem.  We 
have  discovered  a  more  general  and  elegant  method  for  scheduling  multiple  priority  classes  in 
an  efficient  way.  While  the  algorithm  is  simple,  the  proof  of  its  correctness  is  rather  complex. 
We  aim  to  have  this  work  complete  by  June  1999.  One  or  two  journal  papers,  in  addition  to 
at  least  one  conference  publication,  will  be  prepared  on  the  basis  of  this  material. 

3.1.3  Capacity  of  Fading  Channels 

This  paper  [35]  summaries  some  of  our  recent  work  on  the  application  of  the  concepts  of 
capacity  per  unit  cost  and  reliability  function  per  unit  cost  to  models  of  fading  channels.  The 
cost  is  taken  to  be  the  energy,  or  a  certain  fourth  moment  functional  that  is  sensitive  to  the 
burstiness  of  the  signals,  and  is  related  to  the  ambiguity  function  of  the  input  signal.  The 
channel  is  assumed  to  be  a  Wide-Sense  Stationary  and  Uncorrelated  Scattering  (WSSUS) 
channel.  More  of  this  work  will  be  available  in  [36],  and  in  a  planned  journal  paper.  The 
majority  of  Vijay  Subramanian’s  Ph.  D.  thesis  support  derived  from  the  the  FRI  contract. 
Vijay  plans  to  complete  his  Ph.  D.  in  the  summer  of  1999. 

3.1.4  A  problem  zirising  in  code  synchronization  in  FH-CDMA 

We  studied  a  random  multiple-access  system,  where  T  active  stations  among  M  stations 
contend  over  a  common  medium  in  L  minislots.  Each  active  station  transmits  in  a  subset  of 
these  minislots,  the  subsets  chosen  by  the  stations  being  mutually  independent.  A  success 
is  said  to  occur  if  during  at  least  one  minislot  there  is  a  single  transmission;  otherwise,  a 
failure  occurs.  We  devised  policies  that  attempted  to  minimize  the  probability  of  failure. 
We  assumed  that  the  number  of  active  users  T  is  not  known  a  priori,  so  that  the  worst  case 
failure  probability,  obtained  by  maximizing  over  T  up  to  some  assumed  upper  bound  on  T. 
Comparisons  of  the  probability  of  failure  that  we  achieved  against  lower  bounds  that  we 
derived,  show  that  our  strategies  nearly  minimizes  the  maximum-over-T  failure  probability. 
The  application  we  have  in  mind  is  that  there  are  L  minislots  at  the  front  of  each  packet 
transmission  slot.  A  station  identifies  itself  and  the  hopping  pattern  that  is  will  use  in  the 
rest  of  the  slot  that  follows. 

3.1.5  Wireless  support  of  TCP/IP 

We  have  shown  how  to  support  TCP/IP,  the  Internet  transport  protocol,  over  wireless  links, 
and  what  performance  to  expect.  Our  work  provides  the  first  analytical  approach  to  design 
and  performance  analysis.  In  particular,  we  demonstrate  analytically  [19]  that  TCP-aware 
link  layer  error  recovery  is  an  excellent  approach  to  wireless-wireline  interface  design  for  the 
support  of  TCP.  This  work  builds  on  a  basic  understanding  of  TCP  developed  in  [22]. 
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Connections  over  heterogeneous  networks  encounter  large  bandwidth-delay  products  and 
random  loss  (i.e.,  loss  that  cannot  be  attributed  to  congestion).  Since  TCP/IP  cuts  back 
its  sending  rate  for  any  loss  (since  it  interprets  all  losses  as  congestion),  its  performance 
can  deteriorate  seriously  if  random  losses  occur  frequently.  In  particular,  if  the  end-to-end 
packet  loss  probability  seen  by  TCP  is  q,  and  the  bandwidth-delay  product  is  W,  then  we 
have  shown  that  TCP  gives  poor  throughput  if  qW^  is  large  [22],  Thus,  systems  should  be 
engineered  so  as  to  keep  the  end-to-end  loss  probability  seen  by  TCP  below  l/W^.  In  [19], 
we  put  this  rule  of  thumb  to  work  by  considering  TCP  performance  over  a  Rayleigh  faded 
link.  We  use  a  link  layer  retransmission  scheme  which  delivers  packets  in  order  to  TCP  and 
thus  hides  wireless  losses  from  TCP,  so  that  the  only  remaining  losses  seen  by  TCP  are  at  the 
buffer  at  the  wireless-wireline  interface.  We  show  that  the  buffer  size  required  to  attain  the 
required  end-to-end  loss  probability  of  l/W^  is  moderate.  A  close  match  between  analysis 
and  simulations  lend  further  weight  to  our  design  approach.  Finally,  we  give  an  asymptotic 
argument  that  shows  that  the  interface  buffer  size  required  to  make  this  approach  work  scales 
only  logarithmically  with  the  bandwidth-delay  product  for  very  general  classes  of  wireless 
channels. 

We  have  made  substantial  progress  on  the  issue  of  providing  service  guarantees  over  wire¬ 
less  links.  We  have  considered  a  wireless  downlink  in  which  a  single  transmitter  is  sending 
real-time  data  to  different  receivers  over  different  Rayleigh  fading  channels.  Error  control 
coding  with  interleaving  is  used  to  deal  with  the  fading.  The  approach  consists  of  providing 
a  deterministic  delay  guarantee  and  a  statistical  loss  guarantee.  Since  loss  can  occur  due  to 
either  buffer  overflow  or  due  to  loss  on  the  wireless  link,  it  is  essential  to  partition  the  allow¬ 
able  delay  in  an  optimum  way  between  buffering  and  interleaving.  We  provide  a  framework 
for  doing  this  systematically,  while  providing  mechanisms  for  statistical  traffic  multiplex¬ 
ing.  Results  have  appeared  at  the  International  Symposium  on  Information  Theory  in  1998 
(ISIT’98). 

Despite  the  termination  of  the  FRI  program,  we  are  continuing  this  line  of  research 
with  DoD  support  from  an  ARO  contract  for  research  in  “Communications  in  the  Digital 
Battlefield:  Fundamental  Problems  in  the  Design  of  Heterogeneous  Networks.” 

3.1.6  Dynamic  congestion  control  using  SACKs 

Since  adequate  link  layer  error  recovery  may  not  always  be  available,  it  is  of  interest  to 
design  transport  protocols  that  can  operate  in  unfriendly  environments.  We  proposed  in 
1996  a  transport  protocol  [26]  that  modifies  the  congestion  control  mechanism  in  TCP  to 
take  advantage  of  selective  acknowledgments  (SACKs).  The  performance  of  this  protocol  in 
a  heterogeneous  environment  is  far  superior  to  that  of  TCP.  It  is  worth  noting  that,  while 
SACKs  have  been  proposed  in  the  Internet  Engineering  Task  Force  (IETF)  and  have  been 
previously  shown  to  produce  some  improvement  in  the  performance  of  TCP  with  its  current 
congestion  control  mechanism,  the  fundamental  performance  problems  of  TCP  in  a  wireless 
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environment  cannot  be  solved  without  modification  of  the  congestion  control  mechanism. 

The  poor  performance  of  TCP  in  the  presence  of  random  loss  is  because  the  congestion 
control  algorithm  in  TCP  does  not  distinguish  random  loss  from  sustained  congestion.  This 
in  turn  is  partly  because  TCP  uses  cumulative  acknowledgments,  and  therefore  cannot  ac¬ 
curately  identify  how  many,  and  which,  packets  are  lost  and  should  be  retransmitted.  In 
our  work,  we  have  examined  the  performance  gains  that  could  result  from  the  additional 
information  that  could  be  obtained  using  selective  acknowledgments.  We  have  considered 
one  specific  form  of  selective  acknowledgment,  and  have  considered  one  specific  dynamic 
window-based  scheme  (with  dynamics  similar  to  that  of  the  TCP  congestion  window)  de¬ 
signed  to  exploit  it.  The  performance  gains  over  current  forms  of  TCP  are  substantial,  and 
motivate  further  investigation  into  how  best  to  include  selective  acknowledgments  in  TCP. 
We  have  also  considered  a  fixed  window  scheme  with  selective  acknowledgments  in  order 
to  develop  analytical  guidelines  for  estimating  the  sequence  number  space  and  the  destina¬ 
tion  buffer  (as  a  function  of  the  bandwidth-delay  product)  required  to  support  end-to-end 
selective  acknowledgments.  We  have  found  that  these  grow  as  WlogW,  where  W  is  the 
bandwidth-delay' product,  and  as  where  q  is  the  probability  of  random  loss. 

One  of  our  key  achievements  is  the  development  of  a  novel  framework  for  reliable  data 
transport  over  heterogeneous  networks.  End-to-end  data  transport  protocols  have  two  main 
functions:  error  recovery  and  congestion  control.  The  information  required  by  the  sender  to 
perform  these  functions  is  provided  by  acknowledgements  (ACKs)  from  the  receiver.  The 
Internet  transport  protocol,  TCP/IP,  uses  cumulative  acknowledgements  (CACKs),  which 
provide  a  robust  but  minimal  mechanism  for  error  recovery  which  is  inadequate  for  hetero¬ 
geneous  networks  with  random  loss.  Furthermore,  TCP’s  congestion  control  mechanism  is 
based  on  counting  ACKs,  and  is  therefore  vulnerable  to  loss  of  ACKs  on  the  reverse  path, 
particularly  when  the  latter  may  be  slower  than  the  forward  path,  as  in  asymmetric  net¬ 
works.  Our  contributions  are  as  follows: 

(a)  We  show  that  a  simple  enhancement  of  CACK  provides  sufficient  information  for  end- 
to-end  congestion  control.  We  term  this  ACK  format  total  ACKs  (TACKs).  Modification  of 
TCP-Reno  to  exploit  TACKs  leads  to  significant  performance  improvements  for  asymmetric 
networks,  eliminating  timeouts  during  Reno’s  fast  recovery  phase.  TACKs  do  not  improve 
the  error  recovery  capability  of  TCP,  however,  so  that  the  modified  TCP-Reno  is  still  vul¬ 
nerable  to  random  loss. 

(b)  We  devise  a  novel  ACK  format  that  uses  TACKs  for  congestion  control,  and  negative 
ACKs  (NACKs)  for  efficient  error  recovery.  Typically,  the  main  concern  with  NACKs  is 
that  of  robustness  to  ACK  loss,  and  we  address  this  using  an  implementation  that  provides 
enough  redundancy  to  provide  such  robustness. 

(c)  We  use  the  TACK-f-NACK  acknowledgement  format  as  the  basis  for  a  new  transport 
protocol  that  provides  efficient  error  recovery  and  dynamic  congestion  control.  The  proto¬ 
col  provides  large  performance  gains  over  TCP  in  an  environment  with  random  loss,  and 
is  robust  against  loss  of  ACKs  in  the  reverse  path.  In  particular,  the  protocol  gives  high 
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throughput  upto  a  designed  level  of  random  loss,  independent  of  the  bandwidth-delay  prod¬ 
uct.  This  is  in  contrast  to  TCP,  whose  throughput  deteriorates  drastically  if  the  random  loss 
probability  is  higher  than  the  inverse  square  of  the  bandwidth-delay  product.  Efforts  have 
been  focused  on  producing  a  Request  For  Comment  (RFC)  in  order  to  disseminate  our  ideas 
to  the  Internet  Engineering  Task  Force  (IETF).  The  work  described  above  was  the  subject  of 
an  invited  presentation  at  INFORMS’98  and  has  been  presented  at  ACM  SIGMETRICS’98. 


3.1.7  Providing  Quality  of  Service  Guarantees 

Quality  of  Service  (QoS)  guarantees  to  real  time  traffic  has  been  considered  through  the  ex¬ 
ample  of  a  Rayleigh  faded  wireless  downlink.  Methods  for  statistical  multiplexing  have  been 
considered,  and  it  was  shown  that  QoS  provisioning  in  this  context  requires  consideration 
of  traffic  statistics,  choice  of  link  layer  error  recovery  scheme,  as  well  as  wireless  channel 
statistics.  To  the  best  of  our  knowledge,  this  is  the  first  work  that  provides  detailed  consid¬ 
eration  of  all  these  factors,  and  shows  how  important  joint  optimization  across  all  the  system 
parameters  is  for  efficient  design.  The  work  has  appeared  at  the  International  Symposium 
on  Information  Theory  (ISIT’98)  and  the  International  Conference  on  Universal  Portable 
Communications  (ICUPC’98),  and  has  been  submitted  for  journal  publication. 

3.1.8  Location  Tracking  and  Position  Reporting 

Diminishing  cell  sizes  and/or  the  use  of  increasingly  directed  antennas,  coupled  with  bursty 
or  intermittent  transmission,  complicate  wireless  acquisition  and  synchronization.  It  may  be 
desirable  that  explicit  position  update  messages  be  issued.  There  is  a  basic  tradeoff,  since 
increasing  the  rate  of  updates  can  increase  the  speed  of  paging  or  acquisition,  but  it  also 
increases  channel  congestion  and  network  load.  There  is  also  an  interaction  between  the 
policy  used  for  issuing  updates,  and  the  policy  used  for  paging.  The  basic  tradeoff,  and  the 
interaction  between  location  updates  and  paging,  are  investigated  in  [23]. 

Motion  is  modeled  as  a  discrete-time  Markov  chain,  and  an  iterative  optimization  ap¬ 
proach  is  presented  that  seeks  to  minimize  the  total  cost  of  registration  and  paging.  Given  a 
particular  paging  policy,  the  optimal  registration  policy  is  found  through  an  infinite-horizon, 
discounted  cost  dynamic  programming  framework.  Given  a  particular  registration  policy, 
the  optimal  paging  policy  is  a  maximum  likelihood  ordering  of  the  cell  locations.  Iterations 
on  these  conditionally  optimal  policies  are  performed  until  convergence  is  achieved. 

Various  motion  models  are  considered,  including  motion  in  one-dimensional  ring  and 
two-dimensional  rectangular  and  hexagonal  layouts.  In  general,  regions  in  which  the  mobile 
station  continues  without  attempting  registration  tend  to  follow  the  expected  path  of  the  mo¬ 
bile  station.  The  continuation  regions  tend  to  get  larger  as  the  successful  registration  penalty 
increases  and  as  the  registration  failure  probability  decreases.  Additionally,  it  is  shown  that 
the  iterative  optimization  algorithm  does  not  necessarily  produce  globally  optimal  pairs  of 
policies.  The  algorithm  can  produce  non-optimal  fixed-point  solutions  depending  on  initial 
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conditions  and  motion  parameters. 


3.1.9  Channel  Carrying:  A  Novel  HandofF  Scheme  for  Mobile  Cellular  Networks 

We  have  examined  a  new  scheme  that  addresses  the  call  handofF  problem  in  mobile  cellular 
networks.  Efficiently  solving  the  handofF  problem  is  important  for  guaranteeing  Quality  of 
Service  (QoS)  to  already  admitted  calls  in  the  network.  Our  scheme  is  based  on  a  new 
approach  called  channel  carrying',  when  a  mobile  user  moves  from  one  cell  to  another,  under 
certain  mobility  conditions,  the  user  is  allowed  to  carry  its  current  channel  into  the  new 
cell.  We  have  proposed  a  new  channel  assignment  scheme  to  ensure  that  this  movement  of 
channels  will  not  lead  to  any  extra  co-channel  interference  or  channel  locking.  In  our  scheme, 
the  mobility  of  channels  relies  entirely  on  localized  information,  and  no  global  coordination 
is  required.  Therefore,  the  scheme  is  simple  and  easy  to  implement.  We  have  developed  a 
hybrid  channel  carrying  scheme  that  allows  the  maximization  of  performance  under  various 
constraints.  We  have  found  numerical  results  comparing  our  scheme  with  the  traditional 
channel  reservation  and  borrowing  techniques.  We  have  found  that  our  scheme  outperforms 
the  reservation  scheme  over  a  broad  range  of  traffic  parameters  and  also  outperforms  the 
channel  borrowing  scheme  under  high  loads.  The  results  of  this  research  have  been  reported 
in  [6]  and  [7]. 

3.1.10  New  Localized  Channel  Sharing  Scheme  for  Cellular  Networks 

Enhancing  system  capacity  while  maintaining  quality  of  service  is  an  important  issue  in 
wireless  cellular  networks.  A  new  localized  channel  sharing  scheme  has  been  found  to  address 
this  problem.  Our  basic  idea  is  to  allow  channels  to  be  shared  between  adjacent  cells.  We 
have  proposed  a  fixed  channel  assignment  scheme  to  maximize  channel  reuse  efficiency  while 
allowing  channel  sharing.  We  have  shown  that  our  sharing  scheme  can  also  facilitate  handofF 
processing.  An  important  feature  of  our  sharing  scheme  is  that  channel  management  is 
localized  between  adjacent  cells,  and  no  global  coordination  or  optimization  is  required,  thus 
making  it  suitable  for  implementation. 

We  have  obtained  simulation  results  comparing  our  scheme  with  the  conventional  channel 
assignment  and  handofF  techniques.  We  found  that  our  scheme  improves  system  capacity 
over  a  broad  range  of  traffic  parameters  and  a  variety  of  quality  of  service  requirements.  The 
results  of  this  research  have  been  reported  in  [8],  [9],  and  [10]. 

3.1.11  An  EfiScient  Scheme  to  Reduce  Handoff  Dropping  in  LEO  Satellite  Sys¬ 
tems 

The  problem  of  handoffs  in  cellular  networks  is  compounded  in  a  low-earth  orbit  (LEO) 
satellite-based  cellular  network  due  to  the  relative  motion  of  the  satellites  themselves  with 
respect  to  a  stationary  observer  on  earth.  Typically,  the  velocity  of  motion  of  mobiles  can 
be  ignored  when  compared  to  the  very  high  velocity  of  the  footprints  of  satellites.  We  have 
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exploited  this  property  of  the  LEO  satellite  systems  and  proposed  a  handoff  scheme  based 
on  [10]  that  results  in  a  significant  decrease  in  handoff  dropping.  For  the  same  handoff 
dropping  probability,  our  scheme  has  significantly  lower  new  call  blocking  probability  than 
the  conventional  reservation  scheme.  We  have  presented  an  analytical  approximation  that 
is  in  very  good  accord  with  simulation  results.  The  results  of  this  effort  have  been  reported 
in  [11], 

3.1.12  Channel  Sharing  Scheme  for  Packet-Switched  Cellular  Networks 

We  have  examined  an  approach  for  sharing  channels  to  improve  network  utilization  in  packet- 
switched  cellular  networks.  The  scheme  exploits  unused  resources  in  neighboring  cells  with¬ 
out  the  need  for  global  coordination.  A  minimax  approach  to  optimizing  the  allocation  of 
channels  in  this  sharing  scheme  has  been  formulated,  a  distributed  algorithm  to  achieve  this 
objective  has  been  developed,  and  the  convergence  has  been  studied.  Simulation  results  have 
shown  that  the  distributed  channel  sharing  scheme  performs  better  than  the  fixed  channel 
scheme  over  a  wide  variety  of  traffic  conditions.  The  results  of  this  effort  have  been  reported 
in  [12].  •  ' 

3.1.13  CDMA  Systems  in  Fading  Channels:  Admissibility,  Network  Capacity, 
and  Power  Control 

Due  to  the  fast-growing  demand  for  network  capacity  in  wireless  networks,  the  characteri¬ 
zation  of  network  capacity  has  become  a  fundamental  and  pressing  issue.  While  there  have 
been  considerable  efforts  on  CDMA  systems  at  both  the  physical  layer  and  network  layer,  the 
network  capacity  of  imperfect  power-controlled  CDMA  systems  with  linear  receivers  in  fad¬ 
ing  environments  are  less  well-understood.  Hence,  we  have  studied  a  single-cell  synchronous 
CDMA  system  with  matched  filter  receivers,  assuming  known  distributions  of  received  pow¬ 
ers.  We  have  found  necessary  and  sufficient  conditions  on  the  admissibility  for  both  single 
class  and  multiple  class  systems  with  random  signatures.  For  systems  with  deterministic 
signatures,  we  have  found  a  necessary  and  sufficient  condition  on  the  admissibility  for  single 
class  systems,  but  only  a  sufficient  condition  for  multiple  class  systems.  We  have  also  identi¬ 
fied  the  network  capacity  of  single  class  systems  for  both  the  random  signature  case  and  the 
deterministic  signature  case.  The  network  capacity  can  be  uniquely  expressed  in  terms  of 
the  users’  QoS  requirements  and  the  distributions  of  the  received  powers.  We  have  found  the 
tightest  upper  bound  on  the  network  capacity  over  all  possible  distributions  of  received  pow¬ 
ers,  and  have  explored  the  concepts  of  effective  target  SIR  and  effective  bandwidth,  which 
play  an  important  role  in  determining  the  admissibility  and,  hence,  the  network  capacity. 
The  results  of  this  effort  have  been  reported  in  [14],  [15],  and  [16]. 

3.1.14  A  Static  Power  Control  Scheme  for  Wireless  Cellular  Networks 

We  have  found  a  novel  static  power  control  scheme  to  improve  system  capacity  in  wireless 
cellular  networks.  Our  basic  idea  is  to  reduce  intercellular  interference  and  improve  the 
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capture  probability  by  an  a  priori  assignment  of  power-levels  in  different  cells.  We  have  for¬ 
mulated  and  solved  an  optimal  scheduling  problem  for  our  scheme.  We  have  determined  that 
the  optimal  scheduling  policy  is  a  simple  form  of  bang-bang  control.  Our  scheduling  solution 
has  been  illustrated  by  considering  two  specific  cases  with  uniform  and  nonuniform  fairness 
constraints.  We  have  evaluated,  via  numerical  analysis  and  simulation,  both  throughput  and 
delay  of  the  new  scheme,  and  have  compared  these  performance  metrics  with  other  schemes. 
Our  scheme  can  achieve  significant  performance  improvements,  in  terms  of  both  maximum 
throughput  and  throughput-delay  tradeoff,  over  a  wide  range  of  capture  ratio  values.  The 
results  of  this  effort  have  been  reported  in  [13]  and  [17]. 

3.1.15  Analysis  of  a  Distributed  Power  Control  Algorithm 

A  wireless  communication  system  consisting  of  mobile  users  communicating  to  base  stations 
over  a  common  radio  channel  has  been  analyzed  for  a  low  complexity  uplink  power  control 
algorithm.  At  each  iteration,  each  user  needs  only  a  single  bit  of  information  from  its  assigned 
base  station  indicating  whether  its  SIR  is  above  or  below  its  desired  level.  If  it  is  above, 
the  user  decreases  its  power  level  by  a  fixed  amount  (in  dB);  otherwise,  the  power  level 
is  increased  by  a  fixed  amount.  This  type  of  algorithm  has  been  shown  to  be  effective  in 
simulation  studies  of  various  types  of  systems.  Because  of  the  fixed  step  sizes,  the  algorithm 
does  not  actually  converge  to  the  optimal  power  assignment,  but  we  have  proven  its  stability 
and  some  performance  guarantees  in  terms  of  analytically  derived  bounds  on  the  convergence 
error.  We  have  shown  that  the  convergence  error  can  be  made  as  small  as  desirable,  at  the 
cost  of  reducing  the  convergence  rate.  We  have  not  considered  mobility,  but  rather  assumed 
that  the  network  parameters  are  held  constant  while  the  algorithm  runs.  The  results  of  this 
effort  have  been  reported  in  [18]. 
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3.2  Adaptive  Coded  Modulation  for  Spread-Spectrum  Commu¬ 
nication  Systems  (Wayne  Stark,  Univ.  of  MI;  D.  Sarwate, 
Univ.  of  IL,  U.  Madhow,  Univ.  of  IL;  Jim  Lehnert,  Purdue 
Univ.) 

3.2.1  Cancelling  Interference  and  Combining  Multipath  with  a  Linear  Receiver 
and  Long  Sequences 

We  have  examined  blind  adaptive  signal  processing  algorithms  that  work  by  taking  several 
samples  of  the  correlator  output  resulting  from  the  spread-spectrum  signal  during  the  time 
interval  when  multipath  is  arriving  at  the  radio’s  antenna,  and  comparing  them  with  samples 
taken  later.  Through  a  process  of  projecting  in  the  direction  of  the  signal  plus  noise  and  then 
projecting  onto  the  space  orthogonal  to  the  noise,  it  is  possible  to  determine  the  coefficients 
for  a  tapped  delay  line  that  follows  the  matched  filter  performing  the  correlation.  Multipath 
can  be  combined  and  interference  can  be  rejected  simultaneously,  and  if  an  antenna  array  is 
available,  the  same  algorithm  can  accommodate  the  antenna  array  and  steer  a  beam  toward 
the  correct  transmitter  while  producing  nulls  in  the  direction  of  interferers.  An  important 
aspect  of  our  work  is  that  it  applies  to  the  case  of  long  sequences,  which  are  used  in  the 
current  spread-spectrum  cellular  standard  and  must  be  used  in  military  systems  for  security. 
Many  had  thought  that  projection  techniques  required  short  sequences  and  hence  had  limited 
application.  The  net  result  of  this  research  is  to  increase  the  resistance  to  interference  and 
to  again  achieve  a  better  SNR  in  the  link  budget  of  the  system.  This  work  is  described  in 
[42].  The  role  of  the  chip  waveform  selection  for  improving  the  performance  of  such  systems 
is  described  in  [43]. 

3.2.2  Cancelling  Interference  and  Fading  Channel  Estimation  for  Multicarrier 
Spread  Spectrum 

The  near-far  problem  has  been  a  classic  difficulty  for  DS/SSMA  systems.  The  problem  is 
that  an  interfering  transmitter  that  is  close  to  a  receiver  can  overpower  faint  signals.  Even 
though  the  spread-spectrum  modulation  provides  significant  processing  gain  that  helps  to 
extract  faint  signals,  practically  the  gain  is  often  not  enough.  We  have  devised  for  use  in 
multi-carrier  spread-spectrum  systems  a  demodulation  scheme  that  excises  interference  and 
adjusts  for  fading.  This  scheme  is  shown  to  provide  great  gains  in  dealing  with  the  near-far 
problem.  This  result  in  described  in  [41]. 

3.2.3  Meeting  the  Welch  Bound  with  Equality 

A  signal  set  whose  root-mean-square  inner  product  magnitude  equals  the  Welch  lower  bound 
id  called  a  WBE  signal  set.  WBE  signal  sets  are  of  interest  in  synchronous  CDMA  com¬ 
munication  systems.  We  have  studied  the  signal-to-noise  ratio  for  CDMA  communication 
systems  using  WBE  signal  sets.  The  results  are  reported  in  [44]. 
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3.2.4  Design  of  Frequency  Hopping  Patterns 

Careful  design  of  the  frequency-hopping  patterns  used  in  a  frequency-hopped  communica¬ 
tion  system  can  lead  to  gains  in  performance.  Well-known  methods  of  frequency-hopping 
pattern  construction  lead  to  patterns  whose  average  Hamming  correlation  properties  are 
optimal,  but  which  have  some  local  Hamming  correlation  properties  that  are  less  than  desir¬ 
able.  For  example,  the  Lempel-Greenberger  hopping  patterns  have  the  undesirable  property 
that  collisions  between  two  different  patterns  can  occur  in  bursts.  Such  bursts  of  colli¬ 
sions  lead  to  bursts  of  errors  and  to  difficulties  in  the  design  of  the  error-control  coding 
systems  for  frequency-hopped  communication  systems.  Shorter  classes  of  hopping  patterns 
based  on  Reed-Solomon  codes  also  have  optimal  average  Hamming  correlation  but  have  the 
undesirable  property  that  most  of  the  patterns  hop  to  only  one-half  of  the  available  fre¬ 
quency  slots.  We  have  discovered  a  new  class  of  frequency-hop  patterns  whose  periods  are 
intermediate  between  the  short  Reed-Solomon  code  patterns  and  the  much  longer  Lempel- 
Greenberger  patterns.  The  average  Hamming  correlation  properties  of  these  patterns  are 
very  slightly  inferior  to  the  Reed-Solomon  patterns  and  the  Lempel-Greenberger  patterns, 
but  the  new  patterns  exhibit  neither  the  bursts  of  collisions  that  are  characteristic  of  the 
Lempel-Greenberger  patterns  nor  the  poor  spectral  spreading  properties  characteristic  of 
Reed-Solomon  patterns.  Thus,  these  patterns  can  be  used  to  improve  the  performance  of 
frequency-hop  communication  systems. 

The  new  frequency-hop  patterns  also  have  the  very  interesting  property  that  the  collisions 
occur  in  a  very  ‘^systematic”  fashion.  That  is,  if  two  patterns  collide  during  the  i-th  and 
jth  dwells  (say)  of  the  first  n  dwells,  then  they  will  also  collide  during  the  (n-l-i)-th  and  the 
(n+j)-th  dwells.  Here,  n  is  much  smaller  than  the  period  of  the  hopping  patterns,  so  that  the 
patterns  have  a  good  deal  of  substructure.  Since  collisions  are  often  detectable  (in  fact,  the 
good  performance  of  many  of  the  error-control  schemes  is  predicated  on  the  perfect  detection 
of  collisions  and  the  erasure  of  the  corresponding  transmitted  symbols),  the  receiver  knows 
where  collisions  are  going  to  occur  in  the  future.  By  examining  multiple  dwells  spaced  n 
dwells  apart,  the  receiver  can  make  a  more  reliable  decision  as  to  whether  these  dwells  have 
collisions  in  them.  The  periodicity  of  the  collisions  also  improves  the  speed  of  the  decoder 
in  the  error-control  system.  For  example,  if  an  interleaved  Reed-Solomon  code  of  length  n 
is  being  used  for  erasures-and-errors  decoding,  then  the  erasure  locator  polynomial  need  not 
be  calculated  each  time  because  it  is  the  same  for  each  successive  received  codeword. 

An  alternative  strategy  for  exploiting  the  properties  of  these  frequency-hopped  patterns 
uses  a  feedback  channel  by  means  of  which  the  receiver  can  inform  the  transmitter  of  the 
locations  of  the  collisions.  Armed  with  this  knowledge,  the  transmitter  might  use  adaptive 
transmission  strategies.  For  example,  the  transmitter  might  maintain  radio  silence  during 
the  affected  dwells,  thus  reducing  power  consumption  (as  well  as  reducing  interference  for 
the  other  transmitter.)  Of  course,  such  interruptions  reduce  the  data  throughput,  but  this 
can  be  ameliorated  by  increasing  the  code  rate  -  in  the  absence  of  collisions,  less  powerful 
error  correcting  codes  can  be  used.  Further  details  can  be  found  in  [24]. 

We  have  also  considered  simple  strategies  for  reducing  the  number  of  errors  due  to  colli¬ 
sions  in  a  frequency-hop  communication  system.  One  method  leads  to  a  very  slight  reduction 
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in  data  rate,  but  a  significant  improvement  in  performance  at  the  limit  when  the  error-control 
coding  system  is  faced  the  maximum  number  of  errors  that  it  is  designed  to  correct.  The 
effect  of  this  on  the  overall  performance  of  the  system  has  been  studied. 


3.2.5  Adaptive  Coding 

Coding  and  modulation  schemes  for  direct-sequence  spread-spectrum  (DS-SS)  systems  with 
multiple-access  (MA)  interference  in  a  fading  environment  have  been  considered.  Our  goal 
has  been  to  adapt  the  code  to  different  environments.  Communication  in  an  environment 
with  a  large  amount  of  interference  requires  low  transmission  (code)  rate  while  small  levels 
of  interference  allows  high  rate  transmission.  We  have  considered  the  following  coded  mod¬ 
ulation  systems;  a  conventional  64-ary  orthogonal  code,  a  trellis  coded  Nordstrom-Robinson 
(TCNR)  code,  and  a  trellis  coded  Reed-Muller  (TCRM)  code.  Each  of  these  is  concatenated 
with  a  (63,k)  Reed-Solomon  (RS)  outer  code.  The  performance  measure  of  interest  is  error 
probability  and  throughput.  These  are  evaluated  by  making  a  Gaussian  approximation  on 
the  multiple-access  interference.  Our  results  show,  given  an  error  probability  requirement, 
and  a  certain  data  rate,  a  maximum  number  of  users  for  each  of  the  three  DS-SS  schemes 
can  be  determined.  This  indicates  the  receiver  can  adjust  the  coding  and  modulation  scheme 
to  achieve  the  given  requirement.  In  addition  to  the  above,  two  additional  research  areas 
have  been  explored,  several  new  avenues  of  research  have  begun.  These  include  the  exam¬ 
ination  of  turbo  codes  for  frequency-hopped  spread-spectrum  systems  subject  to  jamming, 
and  the  optimization  of  a  product  code  using  Reed-Solomon  codes  to  frequency-hopped 
spread-spectrum  with  partial-band  jamming.  This  research  work  is  detailed  in  Milcom  ‘97. 

3.2.6  Multi-User  Demodulation  Schemes 

Several  multiuser  demodulation  schemes  for  a  direct-sequence  code-division  multiple  access 
(DS-CDMA)  system  that  provide  soft-outputs  to  enhance  single-user  soft-decision  decoders 
have  been  considered  [32].  First,  we  derived  an  optimum  soft-output  multiuser  estimation 
(OSOME)  algorithm  for  asynchronous  CDMA  channels  with  white  Gaussian  noise  (WGN). 
The  algorithm  generates  a  posteriori  probabilities  (APP)  for  coded  bits  given  the  received 
signal.  This  forward-recursive  algorithm  requires  complexity  and  storage  exponential  in 
the  number  of  users  K  and  linear  in  the  decision  lag  L.  Second,  we  developed  a  reduced- 
computation  suboptimal  version  of  this  algorithm  (SSOME)  which  compares  favorably  in 
complexity  and  storage  with  the  multiuser  soft-output  Viterbi  algorithm  (SOVA).  Further¬ 
more,  we  have  shown  that  the  Bayesian  conditional  decision  feedback  estimation  (BCDFE) 
algorithm  provides  symbol  APP  estimates  that  can  be  considerably  improved  upon  by  a  slight 
modification.  We  derived  the  modified  BCDFE  algorithm  for  whitened  multiuser  channels. 
Simulation  results  of  a  4-user,  convolutionally  coded,  asynchronous  CDMA  system  indicate 
that  the  SSOME  algorithm  with  L=K-1  achieves  a  0.4-0. 5  dB  gain  over  SOVA  with  much 
greater  lag  (and  complexity)  and  keeps  within  0.25  dB  of  the  OSOME  algorithm.  Moreover, 
the  modified  BCDFE  algorithm  provides  a  good  performance/complexity  tradeoff. 
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3.2.7  Decision  Feedback  Sequence  Estimation  for  Unwhitened  ISI  Channels 
with  Applications  to  Multiuser  Detection 

Decision  feedback  sequence  estimation  (DFSE),  which  is  a  reduced-complexity  alternative 
to  maximum  likelihood  sequence  estimation,  can  be  used  effectively  for  equalization  of  inter¬ 
symbol  interference  as  well  as  for  multiuser  detection.  The  algorithm  performs  very  well 
for  whitened  (minimum-phase)  channels.  For  non-minimum-phase  channels,  however,  the 
algorithm  is  not  very  effective.  Moreover,  DFSE  requires  a  noise-whitening  filter,  which  may 
not  be  feasible  to  compute  for  time-varying  channels  like  a  multiuser  DS-CDMA  channel. 
Noise-whitening  is  also  cumbersome  for  applications  that  involVe  bi-directional  equalization 
such  as  the  GSM  system.  In  such  conditions,  it  is  desirable  to  use  the  Ungerboeck  formulation 
for  sequence  estimation  which  operates  directly  on  the  discrete-time  unwhitened  statistic 
obtained  from  conventional  matched  filtering.  Unfortunately,  decision  feedback  sequence 
estimation  based  on  matched  filter  statistics  is  severely  limited  by  untreated  interference 
components.  We  have  identified  the  anti-causal  interference  components  using  an  error 
probability  analysis.  This  has  lead  us  to  a  modified  unwhitened  decision  feedback  sequence 
estimator  (MUDFSE)  where  the  components  are  canceled  using  tentative  decisions.  We  have 
obtained  approximate  error  probability  bounds  for  the  proposed  algorithm.  Performance 
results  indicate  that  the  modified  algorithm  used  on  unwhitened  channels  with  relatively 
prnall  channel  correlations  provides  similar  performance/complexity  tradeoffs  as  the  DFSE 
used  on  the  corresponding  whitened  minimum-phase  channels.  The  algorithm  is  especially 
attractive  for  multiuser  detection  for  asynchronous  direct  sequence  CDMA  channels  with 
long  spreading  codes,  where  it  can  achieve  near-MLSE  performance  with  exponentially  lower 
complexity. 

3.2.8  Reduced  Trellis  and  Tree  Search  Algorithms:  The  Effect  of  The  Receive 
Filter  and  The  Branch  Metric 

We  have  examined  the  role  of  the  receive  filter  and  the  branch  metric  in  decision  feedback 
sequence  estimation  (DFSE)  and  M-algorithm  receivers.  We  have  considered  a  generalized 
receiver  which  consists  of  a  front-end  filter  (matched  to  the  overall  channel  response  or  the 
transmit  filter  response)  followed  by  a  general  transversal  processing  filter  and  a  reduced 
trellis  or  tree  search  algorithm.  A  first  event  error  analysis  of  the  generalized  receiver  has 
indicated  that  a  proper  combination  of  the  processing  filter  and  the  branch  metric  is  necessary 
to  avoid  bias  (untreated  interference  components)  in  the  receiver.  Bias  occurs  in  a  DFSE  or 
M-algorithm  receiver  due  to  a  mismatch  between  the  processing  filter  and  the  branch  metric. 
We  have  shown  that  the  processing  filter  must  either  be  the  appropriate  noise-whitening 
filter  or  the  zero-forcing  filter  in  order  to  achieve  unbiasedness.  The  branch  metric  for  the 
reduced  trellis  and  tree  search  algorithms  for  each  case  of  matched  filtering  has  been  obtained. 
The  well-known  DFSE  and  M-algorithm  receivers,  where  the  search  algorithm  operates  on 
whitened  statistics,  belong  to  the  class  of  unbiased  receivers,  while  the  receivers  where  the 
search  algorithm  operates  on  standard  matched  filter  statistics  belong  to  the  class  of  biased 
receivers.  We  have  characterized  the  error  performance  of  various  DFSE  and  truncated 
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memory  MLSE  receivers  in  terms  of  a  parameter  called  the  error  distance,  which  includes 
the  effect  of  noise  enhancement.  We  have  shown  that  the  whitening  filter  DFSE  receiver  is 
optimum  among  unbiased  DFSE  and  truncated  memory  MLSE  receivers  (with  pre-filtering) 
in  the  sense  that  it  maximizes  the  error  distance.  We  have  obtained  upper  bounds  on  the 
symbol  error  probability  of  the  various  DFSE  receivers  and  outlined  a  generating  function 
approach  to  evaluate  the  bounds.  Simulation  and  analytical  results  have  been  obtained  for 
the  case  of  a  symbol-sampled  channel  model  and  a  fractionally-sampled  channel  model. 

3.2.9  A  New  Fractional  MLSE  Receiver  for  Time-varying  Channels 

We  have  derived  a  new  maximum-likelihood  sequence  estimation  (MLSE)  receiver  that  op¬ 
erates  on  fractionally-spaced  samples  obtained  at  the  output  of  an  analog  filter  matched  to 
the  transmit  pulse-shaping  filter.  A  fractional  MLSE  receiver  was  proposed  by  Hamied  et 
al.  that  requires  a  fixed  noise-whitening  filter  to  whiten  noise  samples  at  the  output  of  the 
transmit  matched  filter.  Due  to  the  presence  of  nulls  in  the  Nyquist  bandwidth  of  practical 
pulse-shaping  filters  like  the  square-root  raised  cosine  filter,  the  noise-whitening  filter  has  a 
long  slowly-damped  delay  response  and  any  practical  length  truncation  results  in  significant 
distortion.  The  new  receiver  does  not  need  noise-whitening.  The  branch  metric  of  the  Viterbi 
algorithm  in  the  receiver  accounts  for  the  correlation  in  the  noise  samples.  The  receiver  is 
insensitive  to  sampler  timing  phase.  An  adaptive  form  of  the  receiver  requires  only  one  step 
prediction  of  medium  response  coefiicients.  These  features  make  the  receiver  attractive  for 
fast  time- varying  channels. 

3.2.10  Combined  Decision-Feedback  Multiuser  Detection/Soft-Decision  Decod¬ 
ing  for  CDMA  Channels 

We  have  evaluated  the  performance  of  decision-feedback  multiuser  detectors  with  coding 
for  CDMA  channels.  A  combined  multiuser  detection/soft-decision  decoding  scheme  is  pro¬ 
posed  which  allows  total  control  over  the  reliability  of  decision-feedback  mechanism.  Its 
performance  is  compared  to  that  of  conventional  detection  and  linear-decorrelation  detec¬ 
tion  with  soft-decision  decoding.  Simulation  and  analysis  is  undertaken  for  synchronous 
CDMA  channels.  It  is  found  that  the  scheme  works  very  well  especially  for  weak  users  and 
when  multiuser  interference  is  large.  However,  strong  users  do  not  gain  much  with  this 
scheme  over  a  linear-decorrelation  detector. 

3.2.11  Soft-Output  Multiuser  Estimation  for  Asynchronous  CDMA  Channels 

We  have  considered  several  multiuser  demodulation  schemes  for  a  direct-sequence  code¬ 
division  multiple  access  (DS-CDMA)  system  that  provide  soft-outputs  to  enhance  single-user 
soft-decision  decoders.  First,  we  have  derived  an  optimum  soft-output  multiuser  estimation 
(OSOME)  algorithm  for  asynchronous  CDMA  channels  with  white  Gaussian  noise  (WGN). 
The  algorithm  generates  a  posteriori  probabilities  (APP)  for  coded  bits  given  the  received 
signal.  This  forward-recursive  algorithm  requires  complexity  and  storage  exponential  in  the 


27 


number  of  users  K  and  linear  in  the  decision  lag  L.  Second,  we  have  outlined  a  reduced- 
computation  suboptimal  version  of  this  algorithm  (SSOME)  which  compares  favorably  in 
complexity  and  storage  with  multiuser  soft-output  Viterbi  algorithm  (SOVA).  Furthermore, 
we  have  noted  that  the  Bayesian  conditional  decision  feedback  estimation  (BCDFE)  algo¬ 
rithm  [?]  provides  symbol  APP  estimates  that  can  be  considerably  improved  upon  by  a 
slight  modification.  The  modified  BCDFE  algorithm  for  whitened  multiuser  channels  has 
been  derived.  Finally,  simulation  results  of  a  4-user,  convolutionally  coded,  asynchronous 
CDMA  system  have  been  obtained.  The  results  indicate  that  the  SSOME  algorithm  with 
L  =  AT  -  1  achieves  a  0.4  -  0.5  dB  gain  over  SOVA  with  much  greater  lag  (and  complex¬ 
ity)  and  keeps  within  0.25  dB  of  the  OSOME  algorithm.  Moreover,  the  modified  BCDFE 
algorithm  provides  a  good  performance/complexity  tradeoff. 

3.2.12  Decision  Feedback  Sequence  Estimation  for  Unwhitened  ISI  and  Mul¬ 
tiuser  CDMA  Channels 

Decision  feedback  sequence  estimation  (DFSE),  which  is  a  reduced-complexity  alternative 
to  maximum  likelihood  sequence  estimation,  performs  very  well  for  whitened  (minimum- 
phase)  inter-symbol  interference  (ISI)  channels.  However,  it  may  not  be  feasible  to  compute 
the  whitening  filter  for  many  time-varying  channels  like  the  multiuser  asynchronous  direct- 
sequence  CDMA  channel.  Moreover,  the  performance  of  DFSE  is  not  very  good  for  many 
non-minimum-phase  whitened  ISI  channels  and  for  channels  for  which  it  is  desirable  to 
perform  bidirectional  equalization  as  in  the  GSM  system.  In  these  scenarios,  it  is  beneficial 
to  use  the  Ungerboeck  formulation  for  sequence  estimation  which  operates  directly  on  the 
discrete-time  unwhitened  statistic  obtained  from  conventional  matched  filtering.  We  have 
shown  that  conventional  decision-feedback  sequence  estimation  in  the  Ungerboeck  form  is 
interference  limited.  An  error  probability  analysis  has  led  us  to  a  modified  unwhitened 
decision  feedback  sequence  estimator  (MUDFSE).  Simulation  results  have  indicated  that 
the  modified  algorithm  used  on  unwhitened  ISI  channels  with  low  to  moderate  dispersion, 
provides  similar  performance/complexity  tradeoffs  as  the  DFSE  used  on  the  corresponding 
whitened  TninimuTn-phcise  channels.  The  algorithm  is  especially  attractive  for  multiuser 
detection  for  asynchronous  DS-CDMA  channels  with  long  spreading  codes,  where  sizable 
gains  over  the  (unwhitened)  M-algorithm  are  obtained. 

3.2.13  Decision  Feedback  Sequence  Estimation  for  Unwhitened  ISI  Channels 

A  form  of  decision-feedback  sequence  estimation  (DFSE)  has  been  examined  for  unwhitened 
inter-symbol  interference  (ISI)  channels.  DFSE,  which  is  a  reduced-complexity  alternative  to 
maximum  likelihood  sequence  estimation,  performs  very  well  on  whitened  (minimum-phase) 
ISI  channels.  However,  it  may  not  be  feasible  to  compute  the  whitening  filter  for  many 
time-varying  channels  like  the  multiuser  asynchronous  direct-sequence  CDMA  channel.  On 
such  channels,  it  is  desirable  to  use  the  Ungerboeck  formulation,  which  operates  directly  on 
discrete-time  unwhitened  statistic  obtained  from  conventional  matched  filtering.  Unfortu¬ 
nately,  decision-feedback  sequence  estimation  for  unwhitened  channels  is  interference  limited 
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because  the  decision  statistic  depends  on  both  past  and  future  inputs.  Specifically,  the  de¬ 
cision  rule  that  chooses  the  path  with  the  best  accumulated  metric  leading  to  a  reduced 
state,  fails  to  account  for  interference  from  the  future  inputs  not  included  in  the  reduced 
state.  We  have  proposed  a  modified  decision  rule  to  be  integrated  into  the  Ungerboeck  form 
of  sequence  estimation  which  is  not  sensitive  to  channel  phase.  It  can  be  implemented  as  a 
single  stage  or  a  multistage  scheme  depending  on  the  amount  of  dispersion  present  in  the 
channel.  Simulation  results  have  indicated  that  the  algorithm  used  on  unwhitened  chan¬ 
nels  with  low  to  moderate  dispersion,  provides  similar  performance/complexity  tradeoffs  as 
the  DFSE  used  on  the  corresponding  whitened  minimum-phase  channels.  The  algorithm 
is  especially  attractive  for  multiuser  asynchronous  DS-CDMA  channels  with  long  spreading 
codes. 

3.2.14  A  Family  of  Soft-Output  Multiuser  Demodulators  for  Coded  Asynchronous 
CDMA  Channels 

We  have  considered  a  soft-output  multiuser  demodulation  for  asynchronous  CDMA  channels 
with  coding  and  interleaving.  First,  we  derive  an  optimum  soft-output  multiuser  estima¬ 
tion  (OSOME)  algorithm  for  linearly-modulated  Gaussian  CDMA  channels.  The  forward- 
recursive  algorithm  generates  a  posteriori  probabilities  (APP)  for  modulator-input  symbols 
given  the  received  signal,  which  can  be  fed  directly  to  single-user  soft-decision  decoders.  Sec¬ 
ond,  we  have  outlined  a  reduced-state  soft-output  multiuser  estimation  (RSOME)  scheme, 
with  complexity  exponential  in  a  parameter  called  memory  order  chosen  arbitrarily  (smaller 
than  channel  memory).  These  algorithms  can  be  employed  with  conventional  matched  fil¬ 
tering  and  do  not  require  noise  whitening.  Finally,  we  have  obtained  simulation  results  of 
a  four-user  coded  asynchronous  CDMA  system.  The  results  indicate  that  the  OSOME  al¬ 
gorithm  achieves  a  0.4  —  0.6  dB  gain  over  soft-output  Viterbi  algorithm  (SOYA)  in  a  power 
controlled  environment  and  the  reduced-state  schemes  provide  a  desirable  performance- 
complexity  tradeoflF. 

3.2.15  Type-II  Hybrid  ARQ  Protocol  with  Adaptive  Coding  and  Spread-Spectrum 

It  is  desirable  to  utilize  information  in  packets  with  errors  whenever  possible.  For  example, 
if  a  packet  is  received  with  one  more  bit  than  the  utilized  error  control  code  can  correct,  a 
packet  with  extra  parity  information  can  be  sent  to  correct  that  error  so  that  the  original 
packet  is  useful.  When  the  channel  is  good,  extra  parity  information  does  not  need  to  be 
sent.  However,  when  the  channel  degrades,  the  extra  parity  is  needed.  Adaptive  coding 
is  achieved  by  sending  fewer  packets,  and  hence  parity  bits,  when  the  channel  is  good  and 
sending  more  parity  packets  when  the  channel  degrades. 

We  have  applied  the  type-II  hybrid  ARQ  protocol  in  a  slotted  DS-SSMA  packet  radio 
system  [30].  Using  the  improved  Gaussian  approximation  technique,  we  have  performed 
static  analysis  on  such  a  system  to  obtain  packet  success  probabilities.  Moreover,  we  have 
employed  two-dimensional  Markov  chains  to  analyze  the  dynamic  behavior  of  the  system. 
Based  on  this  model,  we  have  upper  and  lower  bounded  the  performance  of  the  type-II 
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hybrid  ARQ  protocol  by  considering  a  superior  scheme  and  an  inferior  scheme,  respectively. 

We  have  obtained  the  steady  state  throughput  and  delay  performances  and  the  optimal 
retransmission  probabilities  of  the  two  bounding  schemes.  Finally,  using  a  sample  system  as 
an  example,  we  have  shown  how  to  derive  the  design  criteria  for  the  transmitter  buffers  in 
order  to  achieve  the  system  capacity  with  minimum  delay. 

3.2.16  PAM/QAM  Signal  Sets  for  Trellis-Coded  Spread-Spectrum  Systems 

The  same  effect  as  is  achieved  by  adaptive  coding  can  be  achieved  by  utilizing  the  structure 
of  modulation  schemes  such  as  quadrature  amplitude  modulation  (QAM).  When  the  channel 
is  good,  a  decision  is  made  among  all  the  points  in  the  signal  constellation  corresponding  to 
the  modulation,  and  when  the  channel  is  bad,  a  decision  is  made  among  a  subset  of  the  points 
in  that  constellation.  In  order  to  ascertain  the  benefits  of  high  dimensional  modulations  such 
as  QAM,  the  performance  of  coded  QAM  schemes  in  multiple-access  interference  must  be 
determined. 

An  expression  in  matrix  form  for  the  multiple-access  interference  (MAI)  in  an  asyn¬ 
chronous  direct-sequence  spread-spectrum  multiple-access  communication  system  with.cas- 
caded  sequences,  arbitrary  chip  waveforms,  and  trellis-coded  modulation  (TCM)  with  a  PAM 
or  QAM  signal  set,  has  been  obtained.  TCM  provides  significant  coding  gain,  while  cascaded 
sequences  decrease  the  correlation  between  the  MAI  of  adjacent  data  intervals.  The  expres¬ 
sion  has  been  used  to  calculate  arbitrarily  accurate  probability  density  functions  (pdf  s)  of 
the  MAI  in  the  TCM  system  and  to  derive  an  accurate  approximation  of  the  MAI  variance. 

This  has  also  helped  to  illustrate  some  properties  of  the  MAI  by  separating  contributing 
parameters  into  different  matrices.  We  have  derived  an  approximation  of  the  upper  union 
bound  on  the  bit  error  probability  and  investigated  its  applicability.  The  results  [33]  show 
that  cascaded  sequence  schemes  can  greatly  reduce  the  pairwise  error  probabilities  until  the 
length  of  the  cascaded  sequence  becomes  greater  than  that  of  the  error  weight  sequence 
under  consideration. 

3.2.17  Pairwise  Error  Probabilities  in  Coded  Spread-Spectrum  Systems  in  Rayleigh 
Fading  Channels 

Radio  links  encountered  on  the  digital  battlefield  are  typically  coded  and  suffer  from  fading. 

In  order  to  assess  spread-spectrum  links  in  such  an  environment,  it  is  necessary  to  focus  on  the 
channel  state  information  (CSI)  that  is  available  and  also  on  the  pairwise  error  probabilities 
between  codewords  or  paths  through  a  trellis. 

Arbitrarily  tight  upper  and  lower  bounds  on  the  pairwise  error  probability  (PEP)  of  a 
trellis-coded  or  convolutional-coded  direct-sequence  spread-spectrum  multiple-access  (DS/SSMA) 
communication  system  over  a  Rayleigh  fading  channel  have  been  derived  [34].  A  new  set 
of  probability  density  functions  (pdf’s)  and  cumulative  distribution  functions  (cdf’s)  of  the 
multiple-access  interference  (MAI)  statistic  has  been  derived,  and  a  modified  bounding  tech¬ 
nique  has  been  proposed  to  obtain  the  bounds.  The  upper  bounds  and  lower  bounds  together 
specify  the  accuracy  of  the  resulting  estimation  of  the  PEP,  and  give  an  indication  of  the 
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system  error  performance  with  imperfect  channel  state  information  (CSI).  Several  subop¬ 
timum  decoding  schemes  have  been  proposed  and  their  performances  have  been  compared 
to  that  of  the  optimum  decoding  scheme  by  the  average  pairwise  error  probability  (APEP) 
values.  The  approach  can  be  used  to  accurately  study  the  multiple-access  capability  of  the 
coded  DS/SSMA  system  without  numerical  integrations. 

3.2.18  Blind  adaptation  for  interference  suppression  in  DS-CDMA 

We  have  provided  a  method  [25]  for  blind  spatiotemporal  adaptation  for  interference  sup¬ 
pression  in  direct-sequence  CDMA  with  short  spreading  sequences. 


3.2.19  Pipelined  architectures  for  adaptive  interference  suppression  algorithms 

We  have  provided  pipelined  architectures  for  high-speed  low-power  implementations  of  adap¬ 
tive  interference  suppression  algorithms  for  direct-sequence  CDMA  [21]. 
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3.3  Antenna  Array  Processing  (Michael  Zoltowski,  Purdue  Univ.) 

The  goal  of  this  research  has  been  to  develop  space-time  RAKE  receivers  for  direct-sequence 
spread-spectrum  communication  systems  that  achieve  two  primary  goals:  (1)  optimal  combi¬ 
nation  of  the  desired  user’s  multipath  in  a  RAKE-like  receiver  fashion  and  (2)  simultaneous 
cancellation  of  strong  multiple-access  interference  and  jammers.  The  effort  has  concentrated 
on  the  development  and  analysis  of  reduced  dimension  space-time  RAKE  receivers  that  yield 
the  following  two  advantages:  (1)  reduced  computational  complexity  and  (2)  faster  conver¬ 
gence  to  near  optimum  SINR  in  the  symbol  decision  statistic.  The  effort  has  proceeded  along 
the  following  lines: 

•  Convergence  Analysis  and  Tracking  Ability  of  Reduced  Dimension  Blind  Space-Time 
Rake  Receivers  for  DS-CDMA 

•  Joint  Angle  and  Delay  Estimation  for  Reduced  Dimension  Space-time  Rake  Receiver 
With  Application  to  IS-95  CDMA  Uplink 

•  Blind  2-D  RAKE  Receivers  Based  on  RLS-type  Space-Time  Adaptive  Filtering  for 
DS-CDMA  System 

Each  of  these  three  efforts  is  briefly  summarized  below. 

3.3.1  Convergence  Analysis  and  Tracking  Ability  of  Reduced  Dimension  Blind 
Space-Time  Rake  Receivers  for  DS-CDMA 

We  have  examined  reduced  dimension  blind  space-time  RAKE  receivers  for  classical  DS- 
CDMA  that  cancel  strong  multiple-access  interference  while  optimally  combining  the  desired 
user’s  multipath.  After  passing  the  output  of  each  antenna  through  a  matched  filter,  whose 
impulse  response  is  an  oversampled  version  of  the  time-reverse  of  the  spreading  waveform 
of  the  desired  user,  one  estimates  the  reduced  dimension  signal  plus  interference  space-time 
correlation  matrix  during  that  portion  of  the  bit  interval  where  the  RAKE  fingers  occur,  and 
the  reduced  dimension  interference-alone  space-time  correlation  matrix  during  that  portion 
of  the  bit  interval  away  from  the  fingers.  It  was  shown  that  the  weight  vector  yielding  the 
optimum  signal-to-interference-plus-noise  ratio  for  bit  decisions  is  the  largest  generalized 
eigenvector  of  the  resulting  matrix  pencil. 

In  this  work,  we  have  compared  the  asymptotic  SINR  and  convergence  rate  of  full  dimen¬ 
sion  space-time  processing  with  that  of  reduced  dimension  processing  based  on  simultaneous 
time  and  space  compressing  transformations.  We  have  shown  through  theoretical  analy¬ 
sis  and  supporting  simulations  that  if  the  space  and  time  compressing  transformations  are 
designed  judiciously,  reduced  dimension  2D  RAKE  receivers  offer  faster  convergence  with 
respect  to  output  SINR  as  a  function  of  the  number  of  symbols  received  than  that  achieved 
with  a  full  dimension  space-time  RAKE  receiver,  at  the  expense  of  only  a  slight  loss  in 
asymptotic  SINR.  Applications  to  the  IS-95  uplink  were  developed  along  with  supporting 
simulations.  In  addition,  simulations  involving  a  mobile  moving  75  mph  were  conducted 
demonstrating  the  tracking  capabilities  of  the  reduced  dimension  2D  RAKE  receiver. 
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3.3.2  Joint  Angle  and  Delay  Estimation  for  Reduced  Dimension  Space-time 
Rake  Receiver  With  Application  to  IS-95  CDMA  Uplink 

We  have  developed  an  algorithm  for  joint  estimation  of  the  angle  of  arrival  (AOA)  and  delay 
of  each  dominant  multipath  for  the  desired  user  for  use  in  a  reduced  dimension  space-time 
RAKE  receiver  for  DS-CDMA  communications  that  is  near-far  resistant.  After  estimat¬ 
ing  the  desired  spatio-frequency  signal  vector,  we  have  proposed  the  2D  unitary  ESPRIT 
algorithm  as  our  estimator,  which  provides  closed-form  as  well  as  automatically  paired 
AOA-delay  estimates.  We  effectively  have  a  single  snapshot  of  2D  data  and  thus  require 
2D  smoothing  for  extracting  multiple  snapshots.  The  comparative  performance  of  two  2D 
smoothing  schemes,  called  pre-eigenanalysis  and  post-eigenanalysis  2D  smoothing,  respec¬ 
tively,  has  been  examined.  The  space-time  data  model  for  the  IS-95  uplink  was  investigated. 
The  performance  of  a  reduced  dimension  space-time  RAKE  receiver  for  the  IS-95  uplink 
using  the  AOA-delay  estimates  has  been  assessed  through  Monte-Carlo  simulations. 

Simulation  results  show  that  our  algorithms  can  accurately  estimate  the  AOA  and  delay 
for  each  multipath  under  near-far  conditions.  The  AOA  information  is  useful  for  FDD 
downlink  beamforming  and  source  localization  for  an  emergency  geolocation  (911)  service. 
The  pre-eigenanalysis  2D  smoothing  algorithm  offers  better  performance  than  the  post- 
eigenanalysis  2D  smoothing  algorithm.  The  paired  AOA  and  delay  information  may  be 
used  to  the  reduce  dimensionality  of  the  space-time  2D  RAKE  receiver.  Our  simulation 
results  show  that  better  separation  between  the  value  of  the  true  Walsh  symbol  decision 
variable  and  the  other  63  decision  variables  may  be  achieved  by  using  the  knowledge  of  the 
paired  angle-delay  estimates,  as  opposed  to  that  obtained  with  full  dimension  space-time 
processing.  This  better  performance  results  from  the  faster  convergence  toward  the  optimal 
weight  vector  due  to  the  reduced  dimensionality  under  conditions  of  limited  available  symbols 
where  channel  characteristics  remain  approximately  stationary.  The  simulation  results  have 
revealed  that  the  performance  of  the  joint  AOA-delay  estimator  and  the  resulting  reduced 
space-time  RAKE  receiver  is  quite  promising.  Work  has  included  examination  of  the  choice 
of  subarray  size  employed  in  2D  smoothing  and  a  comparison  of  the  performance  of  the  joint 
AOA-delay  estimator  to  the  CRB(Cramer-Rao  Bound). 

3.3.3  Blind  2-D  RAKE  Receivers  Based  on  RLS-type  Space-Time  Adaptive 
Filtering  for  DS-CDMA  System 

We  have  developed  a  blind  2D  RAKE  receiver  for  CDMA  that  cancels  strong  multi-user 
access  interference  and  optimally  combines  multipath.  The  weight  vector  yielding  the  opti¬ 
mum  signal-to-interference-plus-noise  ratio  for  bit  decisions  is  the  largest  generalized  eigen¬ 
vector  of  the  spatio-frequency  (spatio-temporal)  correlation  matrix  pencil.  However,  the 
eigen-analysis  based  algorithm  is  on  the  order  of  0{N^)  computational  complexity  and  the 
resulting  spatio-frequency  (spatio-temporal)  correlation  matrix  pencil  is  of  large  dimension. 
This  detracts  from  the  real-time  applicability  of  that  scheme.  A  blind  2-D  RAKE  receiver 
was  thus  developed  based  on  an  RLS-type  space-time  adaptive  filtering  scheme  which  offers 
0{N^)  computational  complexity  and  competitive  performance.  The  applicability  of  the 
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scheme  to  the  IS-95  uplink  has  also  been  addressed  in  a  decision-directed  fashion. 
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4  Integration  of  Research  and  Technology  Transfer 


Attempts  have  been  made  to  obtain  other  awards  and  to  continue  and  expand  indus¬ 
trial  and  university  cooperation  in  order  to  accomplish  technology  transfer.  This  activity 
has  occurred  because  of  our  notification  that  support  has  ended  for  the  Focused  Research 
Programs  across  the  country. 
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3.2  The  Welch  Bounds 


It  was  shown  in  [18]  that 


Y,  =  Y  9k{m){gi{m))* .  (11) 

m£^A  m£M. 


holds  for  the  aperiodic,  periodic,  and  odd  correlation  functions.  More  recently, 
it  has  been  noted  [28]  that  the  same  proof  shows  that  polyphase  correlation 
functions  also  satisfy  this  identity.  Thus,  as  described  in  [21],  it  follows  from 
(11)  that 


YYY  =  Y 

k=ll=lm£M  m£M* 


k=l 


(12) 


Thus  the  following  generalization  of  Proposition  1  holds  [16],  [21],  [28]. 


Proposition  16.  The  aperiodic  and  polyphase  correlation  functions  (which  in¬ 
clude  the  periodic  and  odd  correlation  functions  as  special  cases)  for  a  signal  set 
satisfy  the  following  lower  bound: 

EE  E  a 

k=l  1^1  meM 

where  equality  holds  in  (13)  if  and  only  if 
K 

=  0  for  all  m  E  M*  •  (^-4) 

^=1 


Since  the  left  side  of  (13)  includes  K  in-phase  autocorrelations  with  total  value 
the  Welch  bounds  on  the  RMS  correlation  are  as  follows: 

INK-N 

while  the  more  familiar  form  is 

INK-N 

^1"“  -  V  ^  MK  -  1  •  ^ 

The  bounds  for  the  polyphase  correlations  (including  periodic  and  odd  correla¬ 
tions)  are  slightly  larger  than  the  right  side  of  (4)  but  are  nonetheless  still  slightly 
smaller  than  y/N.  The  bounds  for  aperiodic  correlation  are  slightly  smaller  thaji 
y/N/2  since  M  =  2N  -  1  in  this  case. 


Welch  obtained  the  bounds  (15)  and  (16)  for  the  periodic  and  aperiodic 
correlation  functions  as  a  direct  consequence  of  the  inner  product  bound  (3). 
Thus,  in  [33],  it  was  shown  that  for  periodic  and  aperiodic  correlations  functions, 

k=l  l=l  m€M 

Mow  [15]  has  shown  that  a  straightforward  modification  of  Welch’s  argument 
proves  (17)  for  odd  correlation  functions  as  well,  and  a  further  simple  modifica¬ 
tion  shows  that  (17)  holds  for  polyphase  correlations  as  well. 


1 


M  +  5  ■ 
s 


(17) 


3.3  Complementary  Signal  Sets 

Sets  of  signals  satisfying  the  relation 

K 

C/c(m)  =  0  for  all  m  /  0  (18) 

k=l 


are  called  complementary  signal  sets  [5],  [22],  [30]. 

Proposition  17.  The  aperiodic  and  polyphase  correlation  functions  (which  in¬ 
clude  the  periodic  and  odd  correlation  functions  as  special  cases)  for  a  comple¬ 
mentary  signal  set  satisfy 


K  K 

k=ll=l  m 

and  thus  meet  the  Welch  bound  (15)  on  RMS  correlation  with  equality. 

Various  designs  for  complementary  signal  sets  are  given  in  [5],  [22],  and  [30]. 
However,  in  hindsight,  it  seems  that  it  is  really  very  easy  to  construct  a  comple¬ 
mentary  signal  set.  The  author  did  not  find  an  explicit  reference  to  cite  for  the 
following  construction,  but  its  elementary  nature  suggests  that  it  has  undoubt¬ 
edly  been  noted  by  several  researchers  already. 

Proposition  18.  Every  WBE  signal  set  is  a  complementary  signal  set. 

Proposition  18  and  the  results  of  Section  II  imply  that  a  surprisingly  large  num¬ 
ber  of  signal  sets  are  actually  complementary  signal  sets.  For  example,  almost  all 
“interesting”  codes  and  their  cosets  are  complementary  signal  sets.  As  a  special 
case,  it  was  noted  in  [22]  that  irreducible  cyclic  codes  and  their  cosets  are  com¬ 
plementary  signal  sets,  and  in  particular,  the  expurgated  set  of  Gold  sequences 
of  Proposition  12  is  a  complementary  signal  set. 


4  WBE  Signal  Sets  in  Asynchronous  CDMA  Systems 


4.1  The  Signal-to-Noise  Ratio  Parameters 

The  signal-to-noise  ratio  (SNR)  for  asynchronous  binary  CDMA  communication 
systems  was  defined  by  Pursley  [17]  and  has  been  the  subject  of  considerable 
study  since.  Let 

^lkAn)=  E  Ck,imck.iQ+n)r=  E  (i9) 

i=i^N  i=i-Ar 

where  the  second  equality  follows  from  a  generalization  of  (11)  (see,  e.g.  Eq. 
(5.24)  in  [24].)  In  [17]  it  is  shown  that  the  variance  of  the  multiple- access  inter¬ 
ference  in  a  receiver  that  is  synchronized  to  the  z-th  signal  is  proportional  to  the 
SNB  parameter 


n  =  2/z/c,t(0)  +  Re[/z;c^f(l)]  Prf  Ri  —  2///e,,'(0). 

kiii 


Now  suppose  that  the  set  of  signature  sequences  is  a  WBE  signal  set  or,  more 
generally,  a  complementary  signal  set.  Then,  from  (18),  (19),  and  the  property 
Ci{—1)  =  [C, •(/)]*  of  the  aperiodic  autocorrelation  function,  it  follows  that 


N-l 


r<=2(A-l)iV2-4^|C'.(/)|2 

+2iK  -  2)ARe[a(l)]  -  2Re  (  ^  Ciil)[Ciil  +  1)]*  )  (20) 


^JV-2 

E 


and  thus 

iV-l 

J?i  =  2(A-l)N2-4  53|Ci(/)p.  (21) 

l=l 


The  sum  |C,(0P  (21)  is  called  the  sidelobe  energy  of  the  signal  x(’)  and  is 

nonzero  (except  for  TDM  A  signal  sets).  Hence,  R,-  <  2{K  —  1)N^  (and  generally, 
r,-  <  2{K  -  1)A^  also)  for  all  interesting  WBE  signal  sets.  Consider  instead  a 
CDMA  communication  system  in  which  the  K  signature  sequences  are  chosen 
via  random  sampling  (with  replacement)  from  the  (N,  N)  code  as  in  Section 
II.J.  Then,  both  R,-  and  r<  have  expected  value  2{K  -  l)N^.  Thus,  since  each 
Ri  is  smaller  than  2(/<'  -  1)A^  for  WBE  signal  sets,  these  signal  sets  appear  to 
be  somewhat  better  than  random  sequences  with  respect  to  this  measure  of  per¬ 
formance.  Another  curious  aspect  of  (21)  is  that  the  larger  the  sidelobe  energy, 
the  smaller  the  variance  of  the  multiple-access  interference.  Signals  with  small 
sidelobe  energy  are  usually  preferred  as  signature  sequences  in  CDMA  commu¬ 
nication  systems  on  the  grounds  that  such  signals  perform  better  over  multipa.th 


channels  and  are  easy  to  acquire.  Hence,  considerable  effort  has  been  expended 
in  the  past  on  searches  for  signals  with  small  sidelobe  energy.  Paradoxically, 
Eq.  (21)  implies  that  from  the  point  of  view  of  multiple-access  interference,  it 
would  have  been  better  if  the  searches  had  tried  to  find  signals  with  the  largest 
sidelobe  energy  instead  of  the  smallest! 

In  most  instances,  Ri  or  Vi  must  be  determined  via  numerical  evaluation  of 
the  autocorrelation  function  values  needed  in  (20)  or  (21).  It  has  already  been 
shown  that  Ri  <  2{K  —  1)7V’^  for  complementary  signal  sets,  but  it  is  not  obvious 
without  computation  whether  Vi  <  —  1)7\^^  also.  In  some  cases,  however,  it 

is  possible  to  state  more  specific  results. 

-  The  complementary  pairs  of  binary  sequences  described  by  Taki  et  al  [29] 
have  the  property  that  Ck,i{l)  =  0  for  all  even  values  of  /.  It  follows  from 
(19)  that  Vi  =  Ri  <  2{K  —  l)N^  for  any  collection  of  such  complementary 
pairs  of  sequences, 

-  If  the  sequence  alternately  takes  on  real  and  imaginary  values,  then  Ci{l) 
is  real  for  even  values  of  I  and  imaginary  for  odd  values  of  /.  It  follows  that 
Re[Ci{l)Q{l  +  1)]  =  0  and  hence  n  =  Ri  <  2{K  -  1)N^.  Of  course,  the 
signal  set  must  be  a  WBE  signal  set  (or  complementary  signal  set)  for  this 
to  hold,  but  this  requirement  is  easily  satisfied.  If  alternate  columns  of  the 
signal  matrix  of  a  real- valued  WBE  signal  set  X  are  multiplied  by  dij,  then, 
according  to  Proposition  3,  the  resulting  signal  set  X  is  also  a  WBE  signal 
set.  In  particular,  if  A'  is  a  binary  WBE  signal  set,  then  X  is  a.  WBE  signal 
set  over  the  quaternary  alphabet  {+1,  — 1, -|-i,  — i}-  Such  quaternary  signals 
are  a  special  case  of  the  EOE  sequences  described  by  Fukumasa,  Kohno,  and 
Imai  [4]  and  can  also  be  thought  of  as  being  derived  from  binary  sequences 
via  the  elementary  Method  B  of  [9]. 

4.2  SNR  Parameters  for  Some  Cyclic  WBE  Signal  Sets 

Many  of  the  constructions  of  WBE  signal  sets  of  length  N  in  Section  II  are  based 
on  cyclic  codes  of  length  N.  The  signature  sequences  used  in  CDMA  applications 
are  often  required  to  have  full  period  AT,  and  WBE  signal  sets  of  this  type  are 
considered  in  the  next  result. 

Proposition  19.  If  a  WBE  signal  set  X  is  either  a  coset  or  a  union  of  cosets 
of  a  linear  cyclic  code  C  all  of  whose  nonzero  codewords  have  period  N ,  then, 

^\Ciil)\^  =  K{N-l),  1<1<N-1,  (22) 

i=l 

R(X)  =  ^y]Ri  =  2(/<  -  1)N^  -  2N^  +  2N,  (23) 

and  f{X)  =  <2{K  -  1)N^  -  -  N  +  2. 


(24) 


4.3  Optimal  Shifts  of  WBE  Signal  Sets 

Since  R{X)  =  2{K  -  l)N^  -  +  2N  for  a  cyclic  WBE  signal  satisfying  the 

hypotheses  of  Proposition  19,  at  least  one  signal  has  i?,-  no  larger  than  this. 
However,  other  Rk  may  be  somewhat  larger  though  (21)  guarantees  that  each  is 
smaller  than  2{K  -  l)N'^.  Now,  T’^X,  the  m-th  left  cyclic  shift  of  ;f,  is  also  a 
WBE  signal  set  satisfying  the  hypotheses  of  Proposition  19.  But  the  SNR 
parameter  for  need  not  be  the  same  as  ^  —  Rj  [18],  [19],  [24].  Thus,  for 

any  given  t,  one  can  search  for  the  shift  vn,  that  minimizes  Unfortunately, 

since  R{T^X)  =  R{X)  is  fixed,  a  decrease  in  the  SNR  parameter  for  one  signal  is 
always  accompanied  by  an  increase  in  the  SNR  parameter  for  other  signals.  Thus, 
a  better  solution  might  be  to  search  for  that  m  which  minimizes  maxi<i<jf  R,’”  . 

Since  f{X)  <  2{K  -  1)N^  -  iV^  -  IV  +  2  for  the  class  of  WBE  signal  sets 
satisfying  the  hypotheses  of  Proposition  19,  at  least  one  r,-  is  as  small  as  this. 
In  fact,  since  this  bound  is  quite  loose,  one  can  hope  to  find  that  at  least  one 
n  is  as  small  as  R{X)  =  2{K  -  l)N‘^  -  2N^  +  27V.  It  sometimes  happens  that 
all  the  Vi  are  smaller  than  2(/f  —  1)7V^,  but,  in  contrast  to  the  situation  for  Ri, 
there  is  no  guarantee  that  this"  will  always  happen.  As  in  the  previous  paragraph, 
for  any  given  i,  one  can  choose  an  m  that  minimizes  .  Unlike  the  situation 
with  R,  however,  f(T’^X)  general,  so  that  it  is  possible  to  choose  w, 

to  minimize  f{T’^X)  and  obtain  the  best  average  performance,  or  to  minimize 

maXKKif  so  as  to  obtain  the  best  worst-case  performance. 

Thus,  given  a  WBE  signal  set  X  satisfying  the  hypotheses  of  Proposition  19, 
one  can  choose  a  cyclically  shifted  set  T^X  to  minimize  various  SNR  parameters. 
The  performance  improvement  is  not  very  large,  but  is  often  large  enough  to 
make  the  computational  effort  worthwhile.  It  is  also  worth  keeping  in  mind  that 
the  kinds  of  searches  discussed  here  shift  all  the  signals  in  X  by  the  same  tn.  If  the 
shifts  applied  to  the  signals  are  not  the  same,  the  resulting  signal  set  is  no  longer 
a  WBE  signal  set,  and  the  above  results  may  no  longer  apply.  Nonetheless,  it  is 
possible  that  SNR  parameters  might  be  reducible  to  the  same  level  as  those  of  a 
WBE  signal  set  or  even  further.  Indeed,  even  for  non  WBE  signal  sets,  it  is  usually 
possible  to  reduce  the  SNR  parameters  below  2{K-1)N'^  but  the  computational 
effort  in  searching  over  shifts  is  formidable,  and  only  restricted  searches  are 
generally  feasible.  The  advantage  of  beginning  with  a  WBE  signal  set  is  that 
some  measure  of  goodness  of  performance  is  guaranteed  by  the  results  described 
above  without  any  search,  and  the  results  can  be  improved  somewhat  by  a  much 
narrower  search. 


4.4  Subsets  of  WBE  Signal  Sets 

WBE  signal  sets  have  too  many  signals  to  be  of  use  in  practical  CDMA  systems 
since  the  number  of  simultaneous  transmissions  that  can  be  supported  is  gener¬ 
ally  much  smaller  than  the  signature  sequence  length  JV.  On  the  other  hand,  the 
results  proved  above  require  the  cancellations  described  in  (18)  and  these  hold 
only  when  the  entire  set  is  used.  Since  subsets  of  WBE  signal  sets  are  generally 


This  result  was  proved  earlier  for  cosets  of  irreducible  binary  cyclic  codes.  Re¬ 
cently,  it  has  been  proved  for  a  class  of  quaternary  sequences. 

Corollary  19.1.  (Sarwate  [23]):  Proposition  19  holds  for  any  coset  of  an  irre¬ 
ducible  binary  cyclic  code.  In  particular,  it  holds  for  the  coset  which  forms  the 
expurgated  set  of  Gold  sequences  described  in  Proposition  12. 

Corollai-y  19.2.  (Sun  and  Leib  [28]):  Proposition  19  holds  for  A~ ,  an  expur¬ 
gated  quadriphase  signal  set  obtained  from  the  Family  A  described  by  Bozta§, 
Hammons  and  Kumar  [3]. 

Family  is  a  quaternary  supercode  of  a  binary  m-sequence  code,  and  the  signal 
set  A~ ,  which  consists  of  one  nonbinary  signal  from  each  nonbinary  cyclic  equiv¬ 
alence  class  of  .4,  is  a  quaternary  coset  of  the  binary  m-sequence  code.  As  an  ex¬ 
ample  of  this,  note  that  Table  II  of  [3]  lists  the  9  nonzero  cycle  representatives  in  a 
Family  A  of  period  7,  of  which  the  first  is  the  binary  m-sequence  v  =  2002022  and 
the  second  a  quaternary  sequence  u  =  3221211.  Adding  the  cyclic  shifts  of  the 
first  sequence  to  the  second  produces  (cyclic  shifts  of)  the  rest  of  the  sequences  in 
the  Table.  In  other  words,  if  different  cycle  representatives  are  chosen,  the  signals 
in  Table  II  of  [3]  can  be  expressed  as  the  set  {v,  u,  u-\-Tv, ...  ,  u-^T^ v)  while 
the  signal  set  A~  is  {u,  u  -{■  v,u  Tv, ...  ,  u  +  T®!)}.  Thus,  A  is  a  quaternary 
coset  of  the  binary  m-sequence  code  {0,  v,  Tv, . . .  ,  T®v},  and  the  expurgation  is 
remarkably  similar  to  that  used  to  obtain  the  expurgated  set  of  Gold  sequences. 
Sun  and  Leib  [28]  have  also  asserted  with  respect  to  A~  that 
“. . .  it  is  remarkable  that  the  average  user  interference  of  this  family  at 
optimal  phases  outperforms  the  ideal  random  sequences.” 

Actually,  according  to  Proposition  19,  a  very  large  class  of  signal  sets  enjoys 
this  property,  and  the  fact  that  (24)  holds  for  A~  is  not  remarkable  at  all. 
Indeed,  the  property  is  a  result  of  the  underlying  cyclic  structure  and  does  not 
depend  at  all  on  the  coset  representative  -  an  arbitrary  choice  of  quaternary 
sequence  u  in  the  above  would  have  worked  just  as  well!  Nor  is  it  necessary 
to  find  optimal  phases  for  the  family  -  any  choice  of  phase  of  the  quaternary 
sequence  u  gives  the  same  result.  What  is  actually  remarkable  is  that  Proposition 
19  can  be  used  to  construct  a  signal  set  for  which  all  the  SNR  parameters  r,- 
are  smaller  than  2{K  —  1)N^.  The  construction  is  as  follows.  Choose  alternate 
symbols  of  the  coset  representative  from  the  subsets  {0,2}  and  {1,3}  of  Z4 
respectively.  Then  the  signals  are  alternately  real-valued  and  imaginary-valued 
as  described  in  the  previous  subsection.  Hence,  f(A')  =  R{X)  equals  the  right 
side  of  (23).  Furthermore,  each  r,-  =  Ri  <  2{K  -1)N^,  which  is  a  better  result 
than  can  be  guaranteed  via  Proposition  19  for  A~ .  However,  such  a  signal  set 
is  not  necessarily  a  good  choice  for  CDMA  applications.  The  two  corollaries 
above  are  of  importance  because  the  signal  sets  are  interesting  because  of  their 
maximum  correlation  properties  -  the  good  SNR  parameter  properties  are  just  an 
added  attraction.  The  SNR  parameter  properties  by  themselves  do  not  provide 
guidance  in  signal  selection.  Note,  for  example,  that  Proposition  19  also  holds 
for  a  multiset  formed  by  the  union  of  a  coset  with  itself,  but  the  resulting  signal 
set  is  of  no  interest  for  CDMA  applications. 
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Abstraci-This  paper  evaluates  the  performance  of 
decision-feedback  multiuser  detectors  with  coding  for 
CDMA  channels,  A  combined  multiuser  detection/soft- 
decision  decoding  scheme  is  proposed  which  allows  to¬ 
tal  control  over  the  reliability  of  decision-feedback  mech¬ 
anism.  Its  performance  is  compared  to  that  of  con¬ 
ventional  detection  and  linear- decorrelation  detection 
with  soft-decision  decoding.  Simulation  and  analysis 
is  undertaken  for  synchronous  CDMA  channels.  It  is 
found  that  the  scheme  works  very  well  especially  for 
weak  users  and  when  multiuser  interference  is  large. 
However,  strong  users  do  not  gain  much  with  this 
scheme  over  a  linear-decorrelation  detector. 

1  Introduction 

Conventional  detection  for  a  direct  sequence  code¬ 
division  multiple-access  (DS-CDMA)  system  employs 
a  bank  of  matched  filters,  matched  to  the  signature 
waveforms  of  individual  users,  followed  by  decision  de¬ 
vices.  This  scheme  suffers  from  severe  degradation  if 
the  signal  strength  of  some  interfering  users  is  dom¬ 
inant.  The  linear-decorrelation  detector  (LDD)  does 
a  multidimensional  linear  filtering  operation  on  the 
matched  filter  outputs,  which  completely  decorrelates 
the  signal  component  of  each  user  with  those  of  the  in- 
terferering  users.  This  detector  performs  significantly 
better  than  the  conventional  detector.  However,  since 
it  employs  channel  inversion  to  get  rid  of  MAI,  it  en¬ 
hances  noise.  The  maximum  likelihood  sequence  de¬ 
tector  minimizes  the  sequence  error  probability  and 
has  complexity  that  grows  exponentially  with  the  num¬ 
ber  of  users,  making  it  infeasible  for  implementation. 

Several  low- complexity,  sub-optimum  multiuser  de¬ 
tectors  have  been  proposed  in  recent  years  but  most 
of  the  research  done  so  far  on  multiuser  detection  has 
considered  uncoded  transmissions,  with  a  few  excep¬ 
tions  [1].  Error  control  coding,  however  is  important  in 
a  noisy  and  interference  limited  environment  to  obtain 
satisfactory  performance. 

A  family  of  single-stage  decision-feedback  multiuser 
detectors  (DFMD)  has  been  proposed  for  asynchronous 
CDMA  channels  in  [2].  In  these  detectors,  users  make 
decisions  in  the  order  of  decreasing  signal  strength. 
The  MAI  affecting  a  given  symbol  of  a  given  user  is 
classified  as  causal  and  anticausal  depending  on  whether 
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decisions  for  the  symbols  causing  the  interference  are 
available  or  not.  A  DFMD  eliminates  anti-causal  MAI 
using  zero-forcing  filtering  and  attempts  to  cancel  causal 
MAI  using  decision  feedback.  It  is  shown  in  [2]  that 
the  improvement  in  performance  with  a  DFMD  is  ap¬ 
preciable  over  the  LDD,  especially  when  MAI  is  large. 
The  detectors  have  a  nearly  ideal  near-far  resistance. 
Another  desirable  feature  of  these  detectors  is  that 
their  serial  structure  allows  combining  multiuser  de¬ 
tection  with  soft-decision  decoding  for  maximum  ben¬ 
efit.  Note  that  manj  of  the  low- complexity  detec¬ 
tors  proposed  in  recent  years  can  not  easily  be  com¬ 
bined  with  true  soft- decision  decoding.  Examples  in¬ 
clude  the  multistage  detector  proposed  by  Varanasi 
and  Aazhang[3]  which  employs  successive  interference 
cancellation  and  the  post-processing  likelihood  detec¬ 
tor  of  Seshadri  and  Hoeher  (see  [1]  for  references). 

In  this  paper  we  investigate  the  performance  of  single- 
stage  DFMD  with  convolutional  coding  and  soft-decision 
Viterbi  decoding  (SDD)  over  synchronous  CDMA  chan¬ 
nels  in  additive  white  Gaussian  noise  (AWGN).  Vary¬ 
ing  the  depth  of  trellis  truncation  of  the  Viterbi  de¬ 
coder  allows  control  over  the  reliability  of  decisions 
fed  back.  We  compare  the  performance  of  combined 
DFMD/SDD  with  conventional  and  LDD  with  coding 
using  simulation  and  analysis.  Performance  bounds 
are  derived  for  a  two-user  coded  synchronous  case. 

2  Combined  DFMD/SDD:  Model 

Consider  a  coded  asynchronous  CDMA  system  with 
K  users.  The  baseband  signal  of  the  A;th  user  is: 

CO 

Ci  y/r^Sk{t  -  iT) 
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where  cf  are  antipodal  channel  symbols  of  duration  T 
obtained  by  encoding  binary  information  symbols  of 
energy  Wk  with  a  rate  r  code  and  then  mapping  into 
{-hi,  —1}.  Sk{t)  is  the  normalized  signature  waveform 
of  the  kih  user,  Sk{t)  =  0  for  t  ^  [0,T].  Using  the  D 
transform,  the  input  symbol  stream  of  the  kih.  user  is 
written  as 

c’^iD)  =  £  cfD‘. 

»  =  — CO 

The  input  symbol  vector  of  K  users  is  given  by 
ciD)  =  {c\D),cHD),...,c^{D)f. 
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The  receiver  observes  the  sum  of  the  transmitted  sig¬ 
nals  in  additive  white  Gaussian  noise  (AWGN). 

K 
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where  r*  €  [0,r]  is  the  relative  delay  between  users 
and  n{t)  is  WGN  with  power  spectral  density  No/2, 
The  received  signal  is  filtered  by  a  bank  of  matched 
filters,  each  matched  to  the  corresponding  signature 
waveform.  The  sampled  output  of  the  matched  fil¬ 
ter  v{D)y  where  v(jD)  =  {v^{D)yV^{D)y . . ,  ,  (D))'^ 

is  given  by 

v{D)  =  S{D)Wc{D)  +  z{D) 

where  W  =  diag(yru;i,  y/rw2, . . . ,  y/rwK)  and  S{D) 
is  the  channel  spectrum 

S{D)  =  R^iD^^+Ro  +  RiD 

where  Sk{t—Tk)si{t-Ti+iT)dt,  z{D)  is  col¬ 

ored  Gaussian  noise  with  spectrum  Sz{D)  =  {Nq/2)S{D). 
S{D)  is  real-symmetric"and  non-singular  on  the  unit 
circle.  It  can  be  factored  as 

S{D)  =  F'^{D-^)FiD) 

F{D)  and  F{D)~^  are  both  causal  and  stable  matrix 
filters.  F{D)  has  the  following  form 

F{D)  =  Fo  +  FiD-h  F2D^  -h  •  •  •  (1) 

where  Fq  is  lower  triangular  (See  [4]  for  references). 
Note  that  the  definition  of  causality  implied  here  de¬ 
fines  the  “pasf^  for  each  component  A:  of  a  vector  pro¬ 
cess  at  time  i  as  given  by  all  components  at  times 
*-l,t-2,...and  the  components  1,  2,... ,  kA  at  time  2, 
and  the  ^presenf^  as  the  component  itself. 

The  single-stage  decision-feedback  multiuser  detec¬ 
tor  proposed  in  [2]  consists  of  an  anticausal  forward 
filter  0(0)  as  a  zero-forcing  filter  that  eliminates  mul¬ 
tiuser  interference  from  “fuiure^^  symbols  and  a  strictly 
causal  feedback  filter  B(D)  that  attempts  to  cancel 
multiuser  interference  from  ^^pasf^  symbols.  The  op¬ 
timum  forward  and  feedback  filters  that  would  maxi¬ 
mize  the  ideal  signal-to-noise  ratio  (SNR)  at  the  input 
to  the  decision  devices  are  shown  to  be [2] 

G{D)  =  {F^{D-^))-'^  ,  B{D)  =  {FiDydmg{Fo))W 

(2) 

Note  that  G{D)  is  the  whitening  filter  for  the  process 
v{D)  and  y(D)  is  the  resulting  whitened  process 

y{D)  =  G{D)v{D)  =  F{D)Wc{D)  +  n{D) 

where  n(D)  is  WGN. 

The  detector  proposed  in  [2]  orders  the  users  in  de¬ 
creasing  symbol  energy.  In  the  2-th  sampling  time, 
user  1  makes  its  decision  first  based  on  the  correspond¬ 
ing  output  of  the  forward  filter  (y/)  and  the  decisions 


Figure  1:  Combined  Decision- Feedback  Multiuser 
Detector/Soft-Decision  Decoder  (i=:Truncation  depth 
of  the  feedback  device). 


of  all  users  made  at  sampling  times  2- 1,2-2,. . .  .  User  2 
makes  its  decision  next  based  on  yf ,  the  decision  made 
by  user  1  for  the  2-th  sampling  time  and  the  past  de¬ 
cisions  of  all  users,  and  so  on. 

The  multiuser  receiver  structure  we  propose  is  shown 
in  Figure  1.  The  decision-feedback  mechanism  feeds 
back  tentative  decisions  for  convolutionally  encoded 
symbols  (c*(D),  k  =  1,2,...)  obtained  from  the  trellis 
of  corresponding  decoders  at  depth  X,  where  L  is  less 
than  or  equal  to  the  truncation  depth  of  the  Viterbi 
decoder.  Since  the  decision  of  user  2  at  the  2-th  sam¬ 
pling  time  depends  on  the  decision  of  user  1  at  the 
same  sampling  time,  user  2’s  decoder  runs  at  a  lag  of 
L  units  behind  the  decoder  of  user  1,  and  so  on  for 
other  decoders.  Thus  the  total  detection  delay  of  a  ii" 
user  system  is  KL, 

3  Performance  analysis 

3.1  Bounds:  two-user  synchronous  system 

The  input  to  the  Viterbi  decoders  in  Figure  1  is 

c{D)  =  G{D)v{D)  -  B{D)c{D), 
Substituting  G{D)  and  B{D)  from  (2),  we  get 

c{D)  =  di3ig{Fo)Wc{D)-\-n{D) 

+iF{D)  -  dmg{Fo))W{c{D)  -  c(D)).(3) 

Note  that  F(D)  =  Fq  for  a  synchronous  CDMA  sys¬ 
tem.  Thus  B(D)  is  lower-triangular  with  zero  diagonal 
elements.  For  a  two-user  system,  (3)  simplifies  to 

c\D)  =  F^^y^c\D)  +  n\D)  (4) 

c^{D)  =  FPy/F^c\D)d-n^{D) 

+2F^^,/?^Ac\D)  (5) 

where  Ac^{D)  =  _  Since  user  1  sees 

WGN  only,  an  upper  bound  on  its  bit  error  probability 
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can  be  found  [5]  by  substituting  Es  —  {F^yrwi  in 


Thus, 


a;=e“^«/^o 


.(6) 


where  d/  is  the  free  distance  of  the  code  and  A{x)  is 
obtained  as 

yl(a:)  =  ■^G(x,y,z) 

G(x,  y,  z)  is  the  transfer  function  of  the  code,  given  by 


A.  <  Y, 

d^ds 


(8) 


where  Ad  is  the  coefficient  of  the  term  in  the  expan¬ 
sion  of  A{x),  We  also  obtain  a  relatively  loose  closed- 
form  union  bound  for  (see  [6]  for  derivation) 


(9) 


(7) 
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where  gijk  is  the  number  of  paths  on  the  trellis  of 
length  k  with  Hamming  weight  z,  corresponding  to  in¬ 
formation  vectors  of  Hamming  weight  that  diverge 
from  the  all-zero  path  at  a  node  and  remerge  with  it. 

For  user  2,  we  assume  that  the  all-zero  codeword  is 
sent.  We  find  the  probability  that  a  path  having  Ham¬ 
ming  weight  d  that  diverges  from  the  all-zero  path  at 
a  node  and  then  remerges  with  it,  accumulates  higher 
-metric  over  the  unmerged  segment  or  equivalently  the 
decision  variable 

Y  z?  =  +Y^J  +  Y 


where 

Z  =  exp 


1  +  2Pci  sinh 
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Combining  (8)  and  (9),  we  get  a  tight  closed  form  up¬ 
per  bound,  which  we  call  a  compound  union  bound 


Pi.  <  ^Tix) 
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P2(d)-\Z'^\  (11) 


where  N  is  arbitrary. 


/=!  1=1  /=i 

is  less  than  zero^.  (Code  symbol  0  is  mapped  to  +1). 
np,  /  =  1,2,...  are  independent  Gaussian  random  vari¬ 
ables  with  mean  0  and  variance  No/2.  We  model 
Ac[,/  =  1,2,...,  decision  errors  of  user  1  fed  back 
to  user  2,  as  i.i.d.  random  variables  with  distribution 
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3.2  DFMD,  LDD  and  the  conventional  de¬ 
tector:  multiuser  synchronous  system 

We  consider  a  system  with  K  equal  energy  users. 
For  each  detector,  we  find  the  first-order  statistic  for 
the  /-th  code  symbol  of  the  k-th  user: 

DFMD  Assuming  that  cf  =  -hi  is  sent,  the  statistic 
is  given  by 


■  nf  -h  2\/rwIi 


where  is  the  probability  of  code  symbol  error  of 
user  1^  derived  in  [6].  Since  at  the  output  of  a  Viterbi 
decoder,  we  usually  get  errors  in  bursts,  Ac^  are  not  in¬ 
dependent.  But  we  can  assume  independence  if  we  as¬ 
sume  an  interleaver  which  randomizes  all  error  bursts 
at  the  output  of  decoder  1  before  feedback.  However, 
for  all  practical  purposes  an  interleaver  is  not  required. 

Using  the  above  model,  we  get  the  pairwise  error 
probability  as 


P2{d) 


Cl 


iti=i(:2=0,Jbi+)fc2<<i  ^  ^  ' 

y/d 
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^Note  that  the  superscripts  in  and  nf  identify  user  2. 
^Note  from  Figure  1  that  it’s  the  code  symbols  that  are  fed 
back  not  the  information  symbols. 


where  P**  <  1  V  fc  and  if  =  if  is 

the  residual  multiuser  interference  that  affects  the  /-th 
symbol  of  the  ik-th  user  and  comes  from  the  decision 
errors  of  the  lower-indexed  interfering  users. 

We  model  AcJ”,  m  =  1, 2, . . . ,  1  as  i.i.d.  random 

variables  with  distribution 


ip 

I -Pc 


where  P^  is  the  probability  of  code  symbol  error  of 
user  1.  We  note  that  if  a  decision-error  in  the  /-th 
symbol  of  user  i  forces  user  j  {j  >  i)  to  make  an  error 
when  it  is  fed  back  in  the  absence  of  noise,  Ac|  and 

AcJ  would  have  opposite  signs.  This  means  that  the 
correlation  among  Acf^ ,  m  =  1, . . . ,  —  1  is  such  that 

they  tend  to  cancel  each  other  out  when  added.  Thus 
our  assumption  about  residual  multiuser  interference 
(that  it  is  the  sum  of  independent  r.v.’s  AcJ”)  should 
be  pessimistic. 
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Using  this  model  for  Ac]^,  we  approximate  as  a 
Gaussian  random  variable  with  mean  zero  and  vari¬ 
ance  Pci  independent  of  the  noise 

component  nf  that  affects  user  k.  Letting  Df  =  \ 


we  get 

D\  =  ^/2r{w^^  +  Nk  (12) 

where  Nk  is  a  Gaussian  r.v.  with  mean  0  and  variance 


=  l  +  8r^A,  '(If) 

LDD  The  first-order  statistic  for  LDD,  normallized 
by  is  given  by  (12)  where  Nk  has  variance 

V*'(Ar»)  =  >  1 . 

(14) 

In  the  synchronous  case,  if  all  users  have  equal  cross- 
correlations,  then  =  ((-Fo)jbxJb)^^  V  k. 

The  Conventional  Detector  The^  matched  filter 
output  for  the  /-th  symbol  of  the  i-th"user  given  that 

a  H-l  is  sent  can  be  normallized  by  \/X  to  get  (12) 
where  Nk  has  variance 


4  Simulation  Results  Sz  Analysis 

We  simulated  a  two-user  synchronous  CDMA  sys¬ 
tem  with  rate  |,  memory  4,  convolutional  encoding 
and  Viterbi  decoding  with  8-level  quantization.  We 
employed  length  31  spreading  sequences  for  the  coded 
system  and  length  63  spreading  sequences  for  the  un¬ 
coded  system.  Note  that  for  a  synchronous  two-user 
system,  decision-feedback  multiuser  detection  involves 
zero-forcing  detection  for  user  1  and  complete  decision- 
feedback  interference  cancellation  for  user  2.  Thus  for 
user  1,  DFMD  is  the  same  as  LDD.  Bit-error  probabil¬ 
ity  of  user  2  with  DFMD  is  compared  to  that  with  con¬ 
ventional  and  LDD  in  Fig.  2.  Truncation  depths  L — 
4,8,16  and  32  are  used  for  the  feedback  mechanism  in 
the  coded  system.  With  all  truncation  depths  decision- 
feedback  detection  outperforms  linear- decorrelation  for 
the  normal  operating  range  of  the  system  (SNR1>SNR2). 
When  multiuser  interference  is  large  (SNR1>>SNR2), 

L  =  4  is  as  good  as  L  =  32,  and  both  achieve  the 
single-user  bound.  This  happens  because  the  tenta¬ 
tive  decisions  of  user  1  are  reliable  even  when  obtained 
from  the  trellis  at  small  depth,  owing  to  the  high  SNR 
of  the  interfering  user.  Figure  2  shows  how  the  near-far 
resistance  of  DFMD  with  coding  depends  on  L,  For 
the  code  used,  the  minimum  truncation  depth  {Lmin) 
which  does  not  result  in  appreciable  performance  loss 
due  to  truncation  is  known  to  be  15.  It  can  be  seen 


Simulation  results  for  various  multiuser  detectors 


SNR1  (SNR2  =  3.0dB) 


Figure  2:  Simulation  Results. 


that  for  L  >  Lmin,  the  effect  on  the  near-far  resis¬ 
tance  of  the  detector  is  negligible.  Thus  if  variable 
detection  delay  is  allowable,  weak  users  can  virtually 
achieve  single-user  performance  at  all  interference  con¬ 
ditions  by  adjusting  L  according  to  the  levels  of  mul¬ 
tiuser  interference. 

Performance  bounds  are  shown  in  Fig.  3.  By  com¬ 
bining  the  loose  union  bound  of  (9)  with  an  arbitrary 
number  of  exact  union  bound  terms,  we  get  a  tighter 
upper  bound.  For  finite  truncation  L,  the  bounds  di¬ 
verge  for  SNR1<4.0  dB. 

In  Table  1,  we  compare  the  variance  of  the  first- 
order  statistic  for  the  various  detectors  as  given  by  (13), 
(14)  and  (15).  The  system  considered  is  a  coded,  syn¬ 
chronous  CDMA  system  with  K  equal  energy  users. 
The  computations  are  done  assuming  that  all  users  em¬ 
ploy  length  63  Gold  sequences  that  have  equal  cross¬ 
correlations.  Pci  is  determined  assuming  rate  mem¬ 
ory  4,  convolutional  coding.  For  the  conventional  de¬ 
tector,  this  corresponds  to  a  strictly  power-controlled 
system.  However,  the  variance  of  the  first-order  statis¬ 
tic  for  this  detector  blows  up  with  the  number  of  users. 
The  gain  obtained  by  LDD  over  the  conventional  de¬ 
tector  goes  from  1.2  dB  when  A’  =  4,  to  5.3  dB  when 
A  =  20.  The  performance  of  users  with  decision- 
feedback  detection  improves  with  their  order.  For  user 
1,  the  variance  of  the  statistic  is  the  same  as  that  with 
LDD  because  its  detection  strategy  is  the  same  with 
both  detectors.  For  the  A-th  user,  the  variance  of  the 
statistic  is  the  lowest  because  the  detector  attempts  to 
cancel  multiuser  interference  from  all  interfering  users 
and  does  not  employ  any  zero-forcing.  The  highest 
order  user  gains  0.65  dB  over  linear- decorrelator  in  a 
4-user  system.  We  observe  that  when  the  number  of 
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Figure  3:  Performance  bounds  for  combined 

DFMD/SDD  (SNR2  =  3.0  dB). 


1  User  order  |  Conventional 

LDD 

DFMD 

1  Four  equal  energy  users:  | 

1 

1:555 

1.166 

1.166 

2 

1.555 

1.166 

1.131 

3 

1.555 

1.166 

1.079 

4 

1.555 

1.166 

1.002 

1  Twenty  equal  energy  users:  | 

1 

4.476 

1.308 

1.308 

2 

4.476 

1.308 

1.305 

3 

4.476 

1.308 

1.301 

4 

4.476 

1.308 

1.297 

5 

4.476 

1.308 

1.293 

6 

4.476 

1.308 

1.288 

7 

4.476 

1.308 

1.283 

8 

4.476 

1.308 

1.277 

9 

4.476 

1.308 

1.271 

10 

4.476 

1.308 

1.264 

11 

4.476 

1.308 

1.257 

12 

4.476 

1.308 

1.249 

13 

4.476 

1.308 

1.240 

14 

4.476 

1.308 

1.229 

15 

4.476 

1.308 

1.214 

16 

4.476 

1.308 

1.194 

17 

4.476 

1.308 

1.170 

18 

4.476 

1.308 

1.142 

19 

4.476 

1.308 

1.108 

20 

4.476 

1.308 

1.044 

users  is  large,  a  fraction  of  low-order  users  do  not  gain 
much  with  decision-feedback  detection  over  complete 
linear-decorrelation  (See  Table  1).  Thus  it  may  be 
practical  to  employ  complete  linear- decorrelation  for 
the  first  few  lowest-order  users  and  decision-feedback 
detection  for  the  rest  of  them,  thereby  reducing  overall 
detection  delay, 

5  Conclusions 

Combined  decision-feedback  multiuser  detection  and 
soft-decision  Viterbi  decoding  is  proposed  for  asyn¬ 
chronous  CDMA  channels  with  convolutional  coding. 
Simulation  and  analytical  results  are  obtained  for  the 
synchronous  case.  However,  they  also  provide  valuable 
insight  into  the  more  general  asynchronous  case.  The 
scheme  combines  all  the  benefits  of  decision-feedback 
interference  cancellation  with  the  gains  of  soft- decision 
decoding.  It  allows  us  flexibility  to  control  the  relia¬ 
bility  of  decisions  fed  back  and  we  get  a  tradeoff  be¬ 
tween  overall  delay  of  the  receiver  and  performance  at 
medium  or  low  levels  of  interference.  At  high  levels 
of  multiuser  interference,  weak  users  can  achieve  the 
single  user  bound  even  with  low  truncation  lengths  of 
the  feedback  mechanism  because  the  decisions  made 
by  the  stronger  interferers  are  reliable.  If  variable 
detection  delay  is  allowable,  then  weak  users  can  be 
made  to  approach  single-user  bound  for  all  levels  of 
interference.  The  zero-forcing  forward  filter  limits  the 
performance  gain  achievable  by  the  strong  users  in  a 
synchronous  CDMA  system. 
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Table  1:  Variance  of  first-order  statistic  for  various 
multiuser  detectors  {No  =  1). 
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Abstract:  In  this  paper,  we  consider  several  multiuser 
demodulation  schemes  for  a  direct-sequence  code-division 
multiple  access  (DS-CDMA)  system  that  provide  soft- 
outputs  to  enhance  single-user  soft-decision  decoders. 
First,  we  derive  an  optimum  soft-output  multiuser  estima¬ 
tion  (OSOME)  algorithm  for  asynchronous  CDMA  chan¬ 
nels  with  white  Gaussian  noise  (WGN).  The  algorithm  gen¬ 
erates  a  posteriori  probabilities  (APP)  for  coded  bits  given 
the  received  signal.  This  forward-recursive  algorithm  re¬ 
quires  complexity  and  storage  exponential  in  the  number 
of  users  K  and  linear  in  the  decision  lag  L.  Second,  we  out¬ 
line  a  reduced-computation  suboptimal  version  of  this  al¬ 
gorithm  (SSOME)  which  compares  favorably  in  complexity 
and  storage  with  multiuser  soft-output  Viterbi  algorithm 
(SOVA).  Furthermore,  we  note  that  the  Bayesian  condi¬ 
tional  decision  feedback  estimation  (BCDFE)  algorithm  [3] 
provides  symbol  APP  estimates  that  can  be  considerably 
improved  upon  by  a  slight  modification.  We  derive  the 
modified  BCDFE  algorithm  for  whitened  multiuser  chan¬ 
nels.  Finally,  we  present  simulation  results  of  a  4-user, 
convolutionally  coded,  asynchronous  CDMA  system.  The 
results  indicate  that  the  SSOME  algorithm  with  L  K  —  1 
achieves  a  0.4  -  0.5  dB  gain  over  SOVA  with  much  greater 
lag  (and  complexity)  and  keeps  within  0.25  dB  of  the 
OSOME  algorithm.  Moreover,  the  modified  BCDFE  algo¬ 
rithm  provides  a  good  performance/complexity  tradeoff. 

1  Introduction 

Multiuser  CDMA  systems  suffer  from  multiple-access  in¬ 
terference  (MAI),  noise  and  other  channel  impairments. 
Two  techniques  are  generally  used  to  combat  MAI.  The 
first  is  multiuser  demodulation  for  which  there  has  been 
considerable  research.  The  second  technique  is  the  com¬ 
bination  of  coding  and  interleaving.  In  order  to  make  the 
most  of  error  control  coding,  the  demodulator  must  pro¬ 
vide  soft-inputs  to  the  decoders  in  the  form  of  likelihoods, 
symbol  APP’s,  erasures,  etc..  Unfortunately  most  mul¬ 
tiuser  detection  algorithms  ignore  the  possibility  of  gener¬ 
ating  this  reliability  information  and  concentrate  instead 
on  minimizing  the  demodulated  error  rate.  In  this  paper, 
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imder  grant  DAAH04-95-I-0246  €ind  by  Ericsson  Inc. 


we  attempt  to  derive  multiuser  algorithms  that  estimate 
code  symbol  APP’s  and  can  be  effectively  combined  with 
error  control  coding  and  interleaving. 

Recently  considerable  amount  of  work  has  been  done  in 
symbol-by-symbol  detection  techniques  for  channels  with 
ISI  ([1]  and  [3]).  The  optimum  soft-output  algorithm 
(OSA)  of  [1]  can  be  applied  to  the  multiuser  case,  but  this 
requires  noise  whitening.  Despite  the  strides  made  in  the 
whitening  technique  [7],  it  requires  sizable  complexity  and 
is  not  very  suitable  for  long  spreading  sequences  and  time 
varying  channels.  Hayes  et.  al.  derived  an  optimal  symbol- 
by-symbol  detection  (OSSD)  algorithm  for  ISI  channels 
with  correlated  noise  (after  matched  filtering)  [2].  They 
and  other  authors,  however,  over-estimated  the  complex¬ 
ity  of  this  algorithm.  We  derive  an  optimum  soft-output 
multiuser  estimation  (OSOME)  algorithm  based  on  their 
scheme.  The  complexity  of  this  algorithm  is  at  par  with 
the  OSA  but  does  not  require  noise  whitening.  The  OS¬ 
OME  algorithm  is  an  optimum  demodulator  for  a  coded 
multiuser  CDMA  system  with  ideal  interleaving,  in  the 
sense  that  without  exploiting  any  information  about  cod¬ 
ing  in  the  demodulation  process,  it  supplies  each  individual 
decoder  with  as  much  information  as  possible  about  the 
sequence  of  modulator-input  symbols  for  the  correspond¬ 
ing  user  while  suppressing  the  irrelevant  information  about 
other  user’s  sequences.  This  is  called  user-separating  de¬ 
modulation  in  [8]. 

The  recursion  in  the  OSOME  algorithm  can  be  reduced 
to  add-compare-select  operations  only.  The  resulting  sub- 
optimal  version  called  SSOME  has  a  recursion  similar  to 
the  sub-optimum  soft-output  algorithm  (SSA)  of  [1]  but 
differs  in  the  way  symbol  APP’s  are  generated. 

Bayesian  Conditional  Decision  Feedback  Estimation 
(BCDFE)  proposed  in  [3]  for  ISI  channels  is  a  reduced- 
complexity  symbol-by-symbol  estimation  scheme  that  em¬ 
ploys  an  efficient  method  of  trellis  truncation.  While  the 
scheme  is  capable  of  providing  good  hard  decisions  on  sym¬ 
bols,  it  fails  to  deliver  good  estimates  of  symbol  APP’s. 
Fortunately,  this  can  be  fixed  by  a  simple  modification. 
The  modified  BCDFE  algorithm  is  considered  for  mul¬ 
tiuser  estimation.  This  algorithm  requires  noise  whitening. 

Simulation  is  undertaken  to  compare  the  performance  of 
these  algorithms  with  SOVA  [4].  In  addition,  the  complex¬ 
ity  of  these  algorithms  are  compared. 


Corollary  12.2.  Any  coset  of  an  m-sequence  code  or  extended  ui-sequence  code 
is  an  MWBE  signal  set. 

Also,  as  noted  above,  the  entire  Gold  code  is  a  WBE  signal  set.  Constructions 
similar  to  the  one  in  Proposition  12  can  also  be  applied  to  various  other  classes 
of  codes  described  in  [24]  to  obtain  WBE  signal  sets. 

2.6  Binary  Very  Nearly  MWBE  Signal  Sets 

Cosets  of  the  m-sequence  codes  and  extended  m-sequence  codes  are  MWBE 
signal  sets,  but  supercodes  of  these  codes  generally  are  not  MWBE  signal  sets. 
In  two  interesting  cases,  however,  certain  supercodes  of  binary  MWBE  signal  sets 
are  very  nearly  MWBE  signal  sets  in  the  sense  that  /max  =  VN  and  equality 
very  nearly  holds  in  (5). 

The  extended  small  Kasami  codes  are  (2^”,  3n)  extended  binary  cyclic  codes 

whose  parity-check  polynomials  are  ofthe  form  Me  (K)MY(a:)  where  a  €  G/^(2^”) 

and  7  G  Gf  (2”)  are  primitive  elements  and  7  =  (cf.  Chapter  8.7  of  [13]). 

The  inner  products  take' on  values  ±2”  =  ±VN  and  0  only.  Thus,  equality  very 
nearly  holds  in  (5)  for  these  WBE  signal  sets.  One  could  even  say  that  these 
small  Kasami  signal  sets  are  asymptotically  MWBE  signal  sets  in  the  sense 
that  the  ratio  of  the  maximum  inner  product  magnitude  to  the  right  side  of  (5) 
approaches  1  as  N  —y  00. 

The  (2^",4n)  binary  nonlinear  Kerdock  code  (cf.  Chapter  15  of  [13])  is  a 
union  of  cosets  of  the  extended  m-sequence  code  and  is  thus  a  WBE  signal 
set.  Unfortunately,  the  complement  of  each  codeword  is  also  a  codeword,  and 
the  inner  product  of  the  signals  corresponding  to  these  two  codewords  is  —A. 
The  inner  product  of  the  signals  corresponding  to  any  two  noncomplementary 
codewords  takes  on  values  ±2”  =  ±'v/A  and  0.  It  follows  that  the  (2^”,4ti  —  1) 
subcode  obtained  by  choosing  either  a  codeword  or  its  complement  from  the 
Kerdock  code  gives  a  signal  set  of  size  N^/2  and  length  N  with  /max  =  2”  = 
y/W  and  these  Kerdock  signal  sets  are  asymptotically  MWBE  signal  sets.  The 
Kerdock  signal  sets  are  also  optimum  in  the  sense  of  being  the  largest  possible 
binary  signal  sets  for  which  /max  =  VN.  This  was  noted  by  Levenshtein  [11]  who 
showed  that  if  /max  <  VN,  then  K  <  N^/2  for  binary  signal  sets.  In  contrast 
to  the  Kerdock  signal  sets,  the  extended  Gold  codes  of  the  same  length  have 
signals  and  thus  are  twice  the  size.  However,  these  codes  have  /max  =  2-y/A 
which  is  also  twice  as  large.  In  fact,  the  (22",  5n)  extended  large  Kasami  codes 
give  an  even  larger  signal  set  with  signals  for  which  /max  =  2\/A  also  [24]. 

The  signal  sets  described  above  have  lengths  2^".  The  Kasami  construction 
cannot  be  used  for  lengths  22"+i,  while  the  Kerdock  codes  of  length  A  =  22"+^ 
are  linear  and  have  the  same  parameters  as  the  extended  Gold  codes,  viz.  K  = 
A2  and  /max  =  Of  course,  since  -/A  =  2(2"+U/2  jg  not  an  integer,  there 

are  no  nearly  MWBE  signal  sets  for  these  lengths.  However,  since  Sidelnikov  [26] 
has  shown  that  for  binary  signal  sets  of  size  N^,  /max  >  \/2A  —  2,  the  Kerdock 
and  Gold  code  sets  are  optimal  with  respect  to  this  bound  [24]. 


2.7  WBE  Signal  Sets  over  Prime  Power  Alphabets 

Many  of  the  results  stated  above  are  also  applicable  to  codes  over  prime-power 
fields  GF{p‘),  but  often  do  not  produce  any  new  classes  of  signal  sets.  This 
is  because  the  results  hold  only  when  the  elements  of  GF{p’)  are  mapped  to 
the  complex  p-th  roots  of  unity  (and  not  to  the  p*-th  roots  of  unity)  via  x 
exp(j27rTr(x)/p)  where  Tr(-)  denotes  the  trace  function  from  GF{p‘)  to  GF(p). 

In  practical  applications,  however,  there  is  often  the  need  to  design  signals 
over  prime-power  alphabets  such  as  the  quaternary  alphabet  {-|-1,  — 1,  — i}- 

Unfortunately,  codes  over  GF{2'^),  when  used  as  described  above,  yield  binary 
signals  and  not  quaternary  signals.  One  solution  to  this  difficulty  is  to  use  codes 
over  Z4  as  described  in  Proposition  7.  Codes  over  Z4  have  attracted  a  great 
deal  of  research  interest  in  recent  years  [6], [8],  and  many  new  results  are  avail¬ 
able.  A  somewhat  different  solution  is  described  below  wherein  signals  over  a 
^-ary  alphabet  are  constructed  from  short  codes  over  GF{q).  Interestingly,  this 
construction  allows  the  use  of  an  arbitrary  one-one  mapping  from  GF{q)  to  the 
complex  ^-th  roots  of  unity. 

Proposition  13.  Let  C  denote  a  linear  {N,  k)  code  over  the  field  GF{q)  with  the 
property  that  no  code  symbol  Cm  is  a  nonzero  multiple  of  another  code  symbol 
C„.  Let  S{C)  be  a  signal  matrix  obtained  by  mapping  the  codeword  matrix  S{C) 
to  the  complex  q-th  roots  of  unity  via  an  arbitrary  one-one  mapping.  Then  S{C) 
is  a  WBE  signal  set  of  size  K  =  9*  and  length  N. 

Thus,  one  can  use  codes  over  GF{2^)  to  construct  WBE  signal  sets  over  the 
quaternary  alphabet  just  as  one  can  use  codes  over  Z4.  However,  although  both 
types  of  signal  sets  have  the  WBE  property,  codes  over  Z4  often  can  be  designed 
to  have  smaller  /max  than  the  codes  over  GF(2^).  Finally,  note  that  N  cannot 
exceed  {q'^  —  l)/{q  —  1)  if  this  construction  is  used. 

2.8  WBE  Signal  Sets  of  Arbitrary  Size 

WBE  signal  sets  of  obtained  from  codes  are  of  size  size  q^  (or  L  •  q’’)  where 
9*  >  N.  It  is  thus  of  interest  to  consider  whether  a  WBE  signal  set  of  size  K 
and  length  N  exists  for  any  given  K  >  N  >  1.  The  answer  is  trivially  Yes  if  one 
uses  a  (AT,  1)  code  over  Zk-  A  more  interesting  construction  is  as  follows. 

Proposition  14.  Given  any  N  >  I  and  any  K  =  ^  there 

exists  a  WBE  signal  set  of  size  K  and  length  N  over  the  complex  r-th  roots  of 
unity  where  r  is  a  divisor  of  piP2  •  •  ‘Pn- 

The  idea  here  is  to  apply  Proposition  5  to  the  (pf  S  fe,)  extended  cyclic  codes 
to  construct  a  WBE  signal  set  of  size  K  and  length  K  and  then  shorten  it  in 
accordance  with  Proposition  4.  Thus,  given  any  N  >  I,  one  can  construct  a 
WBE  signal  set  of  size  K  and  length  N  for  each  K  >  N,  Note,  however,  that 
since  K  and  K  1  are  relatively  prime,  the  WBE  signal  sets  of  these  sizes 
provided  by  Proposition  14  are  over  completely  different  alphabets.  Of  course,  a 


little  thought  about  the  condition  for  equality  in  Proposition  1  shows  that  it  is 
not  possible  to  augment  any  WBE  signal  set  by  just  one  signal  and  still  retain 
the  WBE  property. 

Just  as  it  is  not  possible  to  add  one  signal  to  a  WBE  signal  set  and  still  retain 
the  WBE  property,  it  is  not  possible  to  delete  one  signal  from  a  WBE  signal 
set  and  still  retain  the  WBE  property.  More  generally,  while  there  might  exist 
subsets  of  WBE  signal  sets  that  are  themselves  WBE  signal  sets  (for  example, 
when  a  WBE  signal  set  is  the  union  of  two  WBE  signal  sets),  not  all  subsets  of 
a  WBE  signal  set  are  WBE  signal  sets. 

Proposition  15.  Let  X  denote  a  WBE  signal  set  of  size  K  and  length  N.  Then, 
for  each  L  <C  K,  at  least  one  subset  y  C.  X  of  size  L  is  not  a  WBE  signal  set. 
However,  if  y  is  a  WBE  signal  set,  then  so  is  its  complement  X  —  y. 

In  fact,  for  at  least  one  subset  y, 

Ems  (3^)>  ■^rms 

and  thus  this  y  is  not  a  WBE  signal  set.  Indeed,  for  most  values  of  L,  no  subset 
of  size  E  is  a  WBE  signal  set. 


2.9  Random  Signal  Sets 

The  standard  model  for  a  randomly  chosen  signal  set  X  over  a  p-ary  alpha¬ 
bet  is  that  the  entries  in  the  K  x  N  signal  matrix  M{X)  are  KN  independent 
identically  distributed  complex-valued  discrete  random  variables  uniformly  dis¬ 
tributed  on  the  complex  p-th  roots  of  unity.  It  is  easily  verified  that  the  signals 
have  energy  N  and  that  for  k^l,  =  N.  Thus,  for  a  randomly 

chosen  signal  set  A', 


E 


K  K 

EEi(« 

U=i  i=i 
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=  K{K-  l)N  -h  KN^  >  K^N 


so  that  the  average  randomly  chosen  set  of  signals  is  not  a  WBE  signal  set. 
Note  that  £'[(/rms(-1^))^]  =  ^  which  is  slightly  larger  than  the  minimum  value 
N{K  —  N)/{K  —  1)  given  by  the  Welch  bound  (4). 

The  standard  model  for  randomly  chosen  signal  sets  is  convenient  because  the 
independence  of  the  symbols  greatly  simplifies  system  analyses.  An  equivalent 
model  is  that  the  K  signals  are  a  random  sample  with  replacement  from  the 
{N,  N)  code  over  GF(p)  described  in  Corollary  7.4.  Sampling  with  replacement 
allows  for  the  possibility  that  the  same  signal  might  be  chosen  more  than  once 
and  thus  Ems  {X)  -  VN.  On  the  other  hand,  a  more  appropriate  model  might  be 
to  construct  X  by  random  sampling  without  replacement  so  that  all  the  signals 
in  X  are  guaranteed  to  be  distinct.  Since  the  {N,N)  code  is  a  WBE  signal  set. 
Proposition  15  shows  that 

E^[(/rms(-V))"]  =  =  arms(C))='  > 


where  C  denotes  the  {N,  N)  code.  Although  the  average  random  subset  X  is  not 
a  WBE  signal  set  under  this  interpretation  either,  it  is  worth  noting  that  the 
RMS  inner  product  magnitude  for  randomly  chosen  signals  is  smaller  than  y/N 
under  this  interpretation. 

2.10  Information  Theory  and  WBE  Signal  Sets 

The  most  important  point  to  keep  in  mind  in  the  analysis  of  communication 
systems  that  use  WBE  signal  sets  is  that  the  various  properties  of  WBE  signal 
sets  do  not  always  apply  to  subsets.  In  particular,  subsets  of  WBE  signal  sets  are 
not  necessarily  WBE  signal  sets.  Consider  then,  the  result  in  [14]  that  if  a  WBE 
signal  set  is  used,  then  the  mean-square  interference  in  a  correlation  receiver  for 
the  /-th  transmitted  signal  is  the  same  for  all  values  of  / ,  that  is, 

=  KN-N\  1  <  /  <  K.  (7) 

kjii 

Now  suppose  that  only  L  <  K  transmitters  are  active.  Since  some  signals  are 
no  longer  present  at  the  receiver  input,  the  mean-square  interference  in  all  the  L 
active  receivers  is  reduced,  and  is  smaller  than  KN  .  Thus,  absence  of  some 
signals  improves  the  performance  of  the  rest.  However,  unless  the  active  subset 
happens  to  be  a  W^BE  signal  set  also,  the  mean-square  interference  in  all  the 
receivers  will  not  be  unt/orm/y  reduced  to  LN-N^.  Indeed,  it  is  easy  to  show  that 
for  all  choices  oil  and  L,  the  average  mean-square  interference  in  the  1-th  receiver 
when  L—1  other  transmitters  are  active  is  LN — +N {N —\){K — L) / {K > 
and  thus  for  at  least  one  choice  of  A  —  1  other  transmitters,  the  mean- 
square  interference  in  the  /-th  receiver  is  worse  than  that  promised  by  the  WBE 
signal  theory.  In  short,  one  cannot  simply  replace  K  with  L  in  (7)  when  only 
L  transmitters  are  active.  The  WBE  property  is  quite  fragile  and  does  not  hold 
for  subsets  or  for  variations  from  the  stated  conditions. 

A  fixed  assignment  of  a  WBE  signal  set  to  transmitters  does  not  retain 
the  WBE  property  when  some  transmitters  are  silent.  Thus,  suppose  that  K 
transmitters  use  a  WBE  signal  set  and  can  simultaneously  transmit  at  rate  R  so 
that  the  symmetric  rate  vector  (/?,  R, . . .  ,  R)  is  achieved.  If  the  first  transmitter 
is  silent,  the  rest  can  still  transmit  via  their  assigned  signals  from  the  WBE 
signal  set  at  rate  R  so  that  rate  vector  {0,R,R,...  ,R)  can  be  achieved.  In  fact, 
since  the  interference  is  less,  higher  rates  should  be  achievable,  but  without  a 
detailed  study  of  the  WBE  signal  set,  it  is  not  clear  what  symmetric  rate 
vector  (0,  R',  R',...  ,,  R')  (where  R'  >  R)  can  be  achieved.  Indeed,  the  entire 
rate  region  achievable  by  a  fixed  assignment  of  a  WBE  signal  set  (that  is,  the  set 
of  rate  vectors  (Ri,  R2,  •  •  •  ,  Rk)  at  which  it  is  possible  to  transmit  via  use  of  the 
WBE  signal  set)  seems  difficult  to  compute  without  detailed  knowledge  of  the 
structure  of  the  WBE  signal  set.  A  variable  assignment  of  signals  to  transmitters 
(depending  on  which  transmitters  have  data  to  send  at  any  given  time)  does  not 
work  either  because  there  may  not  be  a  WBE  signal  set  of  the  right  size.  For 
example,  the  size  of  a  binary  WBE  signal  set  is  either  2  or  a  multiple  of  4  which 
is  not  helpful  if  exactly  13  transmitters  wish  to  be  active. 


In  studying  the  capacity  region  of  multiple- access  channels,  one  might  wish 
to  use  different  WBE  signal  sets  to  achieve  different  rate  vectors.  For  any  given 
K  >  N,  Proposition  14  can  be  used  to  construct  WBE  signal  sets  of  sizes 

K,K  -  1,/^  -  2, _ Thus,  no  matter  which  L  <  K  transmitters  are  active, 

one  could  assign  a  WBE  signal  set  of  size  L  to  the  active  transmitters,  and 
each  active  receiver  would  uniformly  suffer  a  mean-square  multiple-access  inter¬ 
ference  of  LN  -  N^.  Unfortunately,  these  various  signal  sets  are  over  different 
alphabets,  so  that  the  achievable  rates  are  not  directly  comparable  since  they 
apply  to  different  multiple- access  channels.  On  the  other  hand,  if  the  different 
alphabets  are  combined  into  a  single  alphabet,  then  this  large  alphabet  has  size 
P1P2  •  •  -Ps  where  pi,p2>  •  •  •  )Ps  are  all  the  primes  that  do  not  exceed  K.  In  the 
absence  of  multi-user  interference  and  channel  noise,  any  transmitter  can  send 
log2PiP2  •  •  ‘P5  bits  per  symbol  over  the  channel,  so  the  capacity  is  quite  large. 
On  the  other  hand,  only  a  small  subset  of  the  alphabet  is  used  by  the  WBE 
signal  sets  described  above,  so  that  the  rate  vectors  that  can  be  achieved  by  the 
use  of  WBE  signal  sets  may  well  turn  out  to  be  a  very  small  subset  of  the  actual 
capacity  region. 

In  summary,  an  insistence  on  always  using  a^VBE  signal  set  may  be  a  hin¬ 
drance  rather  than  an  advantage,  and  the  simplifications  in  the  analysis  due  to 
the  uniform  interference  level  might  be  insufficient  to  overcome  the  disadvan¬ 
tages.  Subsets  of  WBE  signal  sets  do  not  enjoy  the  WBE  properties  and  results 
on  WBE  signal  sets  are  not  always  applicable  to  the  subsets. 

3  Bounds  on  Correlation  Functions 

The  bounds  on  inner  products  discussed  in  the  previous  section  are  of  interest 
in  studies  of  synchronous  CDMA  communication  systems.  Analyses  of  asyn¬ 
chronous  CDMA  communication  systems  lead  to  the  consideration  of  various 
correlation  functions  of  the  signals  (see,  for  example,  [24]),  and  bounds  on  the 
magnitudes  of  these  are  studied  in  this  section. 


3.1  The  Correlation  Functions 

As  before,  let  X  denote  a  set  of  K  signals  of  length  N  and  energy  N,  and 
let  S{^^)  denote  the  signal  matrix.  Now,  however,  it  is  not  necessary  to  insist 
that  K  >  N,  For  signals  E  the  aperiodic  crosscorrelation  function 

for  and  x^^^  (more  simply,  Ck,i{Tn))  is  defined  as 


CAr,/(m)  =  C^(fc)^^(o(m)  =  < 


(  iV-l-m 

n=:0 

JV— 1+m 

lo,  |m|  >  A^. 


(8) 


For  0  <  m  <  N,  the  periodic  crosscorrelation  function  —  9k,i(m) 

and  the  odd  crosscorrelation  function  =  Bk,i(m)  are  given  by 

dfc,/(»n)  =  0/c,i(m)  +  Ck,i(m  —  N)  and  =  Ck,i{m)  —  Ck,i{rn  —  N),  (9) 

and  have  been  studied  extensively  in  the  context  of  asynchronous  CDMA  systems 
using  biphase  modulation.  Consideration  of  Q-ary  modulation  in  CDMA  systems 
leads  to  the  study  of  the  polyphase  crosscorrelation  function  [4],  [28]  which  is 
defined  as 

where  7  is  a  unit-magnitude  complex  number.  Note  also  that  9^i^J  (0)  = 
for  all  7.  If  ife  =  /,  we  get  the  autocorrelation  functions  which  are  usually  denoted 
by  C'fc(m),  9k{m),  Bkim),  and  respectively.  All  the  autocorrelation  func¬ 

tions  have  the  same  value  for  zero  offset:  ^*(0)  =  dfc(O)  =  (^^(O)  =  ^^^(0)  = 
=  JV. 

Many  results  in  this  section  apply  to  all  of  the  correlation  functions  described 
above.  Let  5(-)  denote  any  of  the  correlation  functions  and  let  the  index  set  M 
be  defined  as 

r  {m  :  1  —  N<m<N  —  1},  if  </(•)  = 

^  \  {m  :  0  <  m  <  AT  -  1},  if  <7(-)  =  dW(-).  ^(O.  ^(O- 

Let  M*  =  M  -  {0},  and  M  =  |A<|  denote  the  cardinality  of  M.  The  maxi¬ 
mum  crosscorrelation  value  5c,max(-T)  and  maximum  out-of-phase  autocorrela¬ 
tion  value  5a,max(‘T)  are  defined  to  be 

=  max  m^  \gk,iH\  and  =  max^rn^.  \gk{m)\ 

The  maximum  correlation  value  is  gmaxi^)  —  rn^x{5fc,max  {X),ga  ,max  {X)}.  The 
RMS  correlation  value  is  obtained  by  averaging  all  K{K  -  1)M  +  K{M  -  1)  - 
K{KM  —  1)  correlation  magnitudes  and  is  thus 

=  —  I  - ^  [K{K  ““  l)M(^c,rms(^))^  +  K{M  —  l)(^a,rms(^))^] 

K{KM  -  1) 

where  ^c,rms(^)  is  the  the  RMS  crosscorrelation  value  obtained  by  averaging 
over  the  K{K  —  1)M  crosscorrelation  magnitudes,  and  ga, rms  {X)  is  the  RMS 
autocorrelation  value  obtained  by  averaging  over  the  K{M  -  1)  out-of-phase 
autocorrelation  magnitudes.  Note  also  that  (?c,max  ^  fl'c.rms)  ga, max  ^  ga, rms>  s^nd 

5^max  ^  5^rms‘ 


Here,  puncturing  means  deletion  of  symbols,  that  is,  columns  of  the  signal  ma¬ 
trix,  of  the  extended  m-sequence  code  with  k  information  symbols.  Next,  con¬ 
sider  cyclic  codes  of  length  N  and  extended  cyclic  codes  of  length  iV  +  1.  If 
the  parity-check  polynomial  h{z)  is  of  period  Ny  then  the  residues  of  mod 
h{z)y  0  <  i  <  N  -  1  a,re  distinct,  and  hence  each  code  symbol  is  a  different 
linear  function  of  the  information  symbols.  Thus  we  have  the  following  result. 

Proposition  11.  Suppose  that  N  is  not  a  multiple  of  p.  Then, 

-  A  cyclic  code  of  length  N  over  GF{p)  is  a  WBE  signal  set  if  and  only  if  its 
parity- check  polynomial  h[z)  has  period  N , 

—  An  extended  cyclic  code  of  length  -|-  1  over  GF(^p)  is  a  WBE  signal  set  if 
and  only  if  its  parity-check  polynomial  h{z)  has  period  N, 

One  way  of  ensuring  that  h(z)  has  period  N  is  to  require  that  h[z)  be  divisible 
by  an  irreducible  polynomial  of  period  N. 

Corollary  11.1.  If  N  is  not  a  multiple  of  p,  then  cyclic  codes  of  length  N  and 
extended  cyclic  codes  of  length  JV  -{-  1  over  GF{p)  are  WBE  signal  sets  if  the 
parity-check  polynomial  h{z)  is  divisible  by  an  irreducible  polynomial  of  period 
N  over  GF{p). 

As  an  example,  the  binary  Gold  codes  are  (2”  —  l,2n)  cyclic  codes  with  parity- 
check  polynomials  Moi{x)Mf3{x)  where  Moc{x)  and  Mp{x)  denote  the  minimal 
polynomials  of  primitive  elements  o,  ^  G  GF{2^)  and  the  Tu-sequences  u  and 
V  generated  by  Ma(x)  and  are  a  preferred  pair  of  m-sequences  [24].  The 

representatives  of  the  cyclic  equivalence  classes  of  the  codewords  in  this  code  can 
be  expressed  as  {v,  u,u-{‘  v,u-{‘Tv,u-\- T^v, ...  ,  u -|- where  T  is  the  left 
cyclic  shift  operator  and  N  =  and  constitute  the  well-known  class  of  Gold 

sequences  [24].  According  to  Corollary  11.1,  Gold  codes  are  WBE  signal  sets,  as 
indeed  are  all  supercodes  of  the  ?n-sequence  codes  and  the  extended  m-sequence 
codes.  In  fact,  the  extended  (2”,2n)  Gold  codes  and  Gold-like  codes  [24]  are 
WBE  signal  sets  of  length  AT  =  2”  and  size  =  2^”  for  which  /max  = 
which  equals  2y/N  or  according  as  n  is  even  or  odd. 

2.5  Some  Interesting  Cosets  of  Cyclic  Codes 

Massey  and  Mittelholzer  [14]  noted  that  the  Gold  sequence  set  described  above 
is  very  nearly  a  coset  of  an  m-sequence  code. 

Proposition  12.  (Massey  and  Mittelholzer  [14]):  The  expurgated  set  of  Gold 
sequences  {u,  u-^  v,u-\-  Tv,  u  -j-  ...  ,  u  -h  T^  is  a  WBE  signal  set. 

In  fact,  from  Proposition  9  and  Corollary  7.2,  it  follows  that 

Corollary  12.1.  The  expurgated  set  of  Gold  sequences  is  an  MWBE  signal  set. 

Of  course,  as  noted  in  [14],  any  coset  of  an  m-sequence  code  is  an  WBE  signal 
set.  More  strongly,  it  follows  from  Corollary  7.2  that 


2.4  Cyclic  and  Extended  Cyclic  Codes 

Let  h(z)  denote  an  irreducible  polynomial  of  degree  k  and  period  N  over  GF{p). 
Then,  the  codewords  of  the  (iV,  k)  cyclic  code  with  parity-check  polynomial  h{z) 
can  be  represented  as 

=  (Tr(a“  •  C),  •  0.  •  0.  •  •  •  .  Tr(a^-i  •  C))  (6) 

where  is  a  root  of  h{z).  Since  the  code  symbols  are  different  linear  functions, 
it  follows  from  Corollary  7.1  that  the  code  is  a  WBE  signal  set.  Now  suppose 
that  this  code  is  extended  to  include  the  zero  function  Tr(0  -C)  as  a  code  symbol. 
This  extra  symbol  is,  in  fact,  an  “overall  parity  check”  since 

g  ■»(«• .  c) = 'li  (e  ■  c)  =  (  (e  “')■<)=  ■ « 

Hence,  this  extended  cyclic  code  is  also  a  WBE  signal  set  which  proves 

Proposition  8.  If  h{z)  is  an  irreducible  polynomial  of  degree  k  and  period  N 
overGF{p),  then  the  {N,k)  cyclic  code,  and  the  {N  +  l,k)  extended  cyclic  code, 
with  parity-check  polynomial  h{z)  are  WBE  signal  sets. 

When  the  parity-check  polynomial  h{z)  is  a  primitive  polynomial  of  degree  k, 
the  cyclic  (p*  - 1 ,  Ife)  code  is  called  a  maximal-length  sequence  code  (henceforth  m- 
sequence  code),  and  the  (p^,^)  extended  code  is  called  an  extended  m-sequence 
code.  Binary  m-sequence  codes  and  extended  binary  m-sequence  codes  are  WBE 
signal  sets  [14].  In  fact,  these  sets  (and  more  generally,  p-ary  m-sequence  codes 
and  extended  m-sequence  codes)  are  MWBE  signal  sets.  Note  also  that  the  signal 
matrices  of  extended  binary  m-sequence  codes  are  2*  x  2*  Hadamard  matrices. 

Proposition  9.  Ifh{z)  is  a  primitive  polynomial  of  degree  k  overGF{p),  then 

—  m-sequence  code:  the  (p*  — l,fe)  cyclic  code  with  parity-check  polynomial  hl^z'^ 
is  a  nonorthogonal  MWBE  signal  set. 

-  Extended  m-sequence  code:  The  (p^.fc)  extended  cyclic  code  with  parity- check 
polynomial  h{z)  is  an  orthogonal  MWBE  signal  set. 

The  signal  matrix  for  the  extended  m-sequence  code  is  a  p*  x  p*  orthogonal 
matrix  whose  entries  are  complex  p-th  roots  of  unity.  In  contrast,  the  discrete 
Fourier  transform  (FDMA)  signal  matrix  for  this  length  is  a  p  x  p“  matrix 
whose  entries  are  complex  p*-th  roots  of  unity. 

The  m-sequence  codes  and  the  extended  m-sequence  codes  are  we]I-  knov»fn  in 
the  literature  as  simplex  signal  sets  and  orthogonal  signal  sets.  They  also  form 
the  basis  for  the  following  observations  about  WBE  signal  sets  derived  from 
linear  codes. 

Proposition  10,  An  {N,  k)  code  is  a  WBE  signal  set  if  and  only  if  it  is  equiv¬ 
alent  under  column  permutation  to  a  punctured  extended  m-sequence  code  with 
k  information  symbols. 


Both  these  results  are  straightforward  generalizations  of  the  Massey  and  Mitteh 
holzer  [14]  result  that  a  linear  binary  code  C  is  a  WBE  signal  set  if  and  only  if  its 
dual  code  has  no  codewords  of  Hamming  weight  2.  An  equivalent  result  due  to 
Assmus  and  Mattson  is  that  linear  binary  codes  satisfying  the  stated  condition 
have  mean-square  Hamming  weight  N{N  -h  l)/4  (cf.  [34]). 

The  remainder  of  this  section  only  considers  codes  over  finite  fields,  but 
many  of  the  results  also  apply  to  codes  over  Zq.  For  codes  over  GF{p),  it  is 
convenient  to  think  of  the  k  information  symbols  as  an  element  C  of  the  extension 
field  GF{p^).  Since  any  linear  function  from  GF{p^)  to  GF{p)  is  of  the  form 
^  Tr(a  •  z)  where  Tr(z)  =  z  +  2:^  +  +  .  •  •  +  is  the  trace  function  from 

GF{p^)  to  GF{p)  and  a  G  GF{p^)  [13],  a  codeword  in  this  linear  code  can  be 
expressed  as 


-  (Tr(ai  •C),Tr(o^2  -O^TrCofs  -Oj-  •  ,Tr(aiv  *0) 

where  ai,  a2,  •  •  •  ,  Q^iV  s^re  N  different  elements  of  GF{p^).  Note  that  at  most  one 
of  the  ai  can  be  0. 

It  is  convenient  to  simply  call  C  a  WBE  signal  set  whenever  S'(C)  is  a  WBE 
signal  set.  A  coset  of  C  is  a  +  C  =  {a  -h  c  :  c  G  C}  where  a  is  a  vector  of 
length  N  over  GF{p).  Then,  S{a  +  C)  =  S{C)U  where  is  a  unitary  matrix.  From 
Proposition  3,  it  follows  that  if  C  is  a  WBE  (MWBE)  signal  set,  then  so  is  a  +  C, 
This  is  a  minor  generalization  of  a  result  noted  in  [14]  for  binary  codes. 

Corollary  7.2,  Cosets  of  codes 

-  If  C  is  a  WBE  signal  set^  then  any  coset  of  C  is  a  WBE  signal  set,  ^ 

-  If  C  is  a  MWBE  signal  set,  then  any  coset  of  C  is  a  MWBE  signal  set. 


From  Proposition  2  it  follows  that 

Corollary  7.3.  IfC  is  a  WBE  signal  set,  then  the  union  of  any  number  of  cosets 
of  C  is  a  WBE  signal  set. 

In  fact,  since  the  entire  space  [GF{p)]^  is  a  union  of  cosets  of  C,  we  have 

Corollary  7.4.  The  {N,N)  code  overGF{p)  is  a  WBE  signal  set. 

Of  course,  this  result  is  also  immediate  from  Corollary  7.1  since  each  codeword 
symbol  in  this  code  is  a  different  information  symbol.  Notice  that  /max  =  N  for 
this  WBE  signal  set  even  though  /rms  <  ViV,  thus  showing  once  again  that  the 
WBE  property  does  not  guarantee  small  values  of  /max* 

The  only  codes  excluded  by  the  hypotheses  of  Corollary  7.1  are  those  with 
repeated  symbols  (whether  zero  or  nonzero).  Codes  with  repeated  symbols  are 
usually  of  interest  only  for  very  noisy  channels;  generally,  better  performance 
is  obtained  if  different  parity  check  equations  specify  the  various  code  symbols. 
Thus,  colloquially.  Corollary  7.1  says  that  almost  all  interesting  codes  are  WBE 
signal  sets.  Some  especially  interesting  codes  are  considered  next. 


Proposition  6.  There  exists  a  binary  WBE  signal  set  of  size  K  if  and  only  if 
K  equals  2  or  is  a  multiple  of  4. 

K  X  K  Hadamard  matrices  provide  binary  MWBE  signal  sets  of  size  K  and 
length  K.  However,  even  for  those  multiples  of  4  for  which  the  existence  of  a 
/{■  X  K  Hadamard  matrix  has  not  been  demonstrated,  it  is  possible  to  construct 
a  binary  WBE  signal  set  of  size  K  and  length  N  <.  K. 

More  generally,  linear  codes  are  the  source  of  large  classes  of  WBE  signal 
sets  over  small  alphabets,  and  these  are  considered  next. 


2.3  WBE  Signal  Sets  from  Linear  Codes 

Let  C  denote  a  linear  [N,  k)  code  over  GF{p),  p  a  prime,  with  the  property  that 
each  code  symbol  Cm  is  a  different  linear  function  of  the  k  information  symbols. 
Since  there  are  only  p*'  different  linear  functions  of  k  variables,  it  must  be  that 
p'^  >  N.  Let  5(C)  denote  the  p^  x  N  matrix  whose  rows  are  the  p''  codewords 
of  C.  Since  the  code  symbols  are  linear  functions,  then  for  any  given  column  of 
'see),  either  each  element  of  GF{p)  occurs  p*~^  times  in  the  column,  or  every 
entry  in  the  column  is  zero.  The  latter  occurs  if  and  only  if  the  corresponding 
code  symbol  is  the  zero  function.  Since  different  code  symbols  must  be  different 
functions,  5(C)  can  have  at  most  one  all-zero  column.  For  m  ^  n,  the  code 
symbols  Cm  and  Cn  are  different  linear  functions,  and  hence  Cm  Cn  is  a 
nonzero  linear  function  of  the  information  symbols.  It  follows  that  the  difference 
of  two  columns  of  5(C)  has  the  property  that  each  field  element  occurs  p''"^ 
times  in  the  difference. 

In  communications  applications,  codewords  are  mapped  into  complex- valued 
vectors  by  mapping  x  E  GF(p)  onto  exp(j2'jrx/p).  This  mapping  is  used  to 
convert  5(C)  into  a  signal  matrix  5(C)  whose  elements  are  the  complex  p-th 
roots  of  unity.  The  inner  product  of  two  columns  is 

pfc  p-i 

(y("‘),y("))  =  ^exp(y27r(C^)  -  C«)/p)  =  p''"^  ° 

.•=1  1=0 


because  takes  on  each  value  0, 1, ...  ,p-  1  an  equal  number  of  times 

as  i  varies  from  1  to  p* ,  and  the  sum  of  the  complex  p-th  roots  of  unity  is  0. 

All  of  the  above  also  applies  to  linear  codes  over  the  integer  ring  Eg  with 
the  minor  modification  that  in  some  cases,  Cm^  —  Cn^  may  take  on  all  values 
0, 1, . . .  ,  r  -  1  in  a  subring  Zr  equally  often.  Hence,  the  following  result  holds: 

Proposition  7.  (Helleseth  and  Kumar  [8]):  IfC  is  a  linear  (N,  k)  code  over  Zg 
such  that  the  code  symbols  Cm  are  different  linear  functions  of  the  information 
symbols,  then  S{C)  is  a  WBE  signal  set  of  size  K  =  q^  and  length  N . 

Corollary  7.1.  Proposition  7  holds  for  linear  codes  over  GF{p). 


Proposition  4.  Let  X  denote  a  WBE  signal  set  of  size  K  and  length  N.  If  all 
the  entries  in  a  column  of  S{X)  are  complex  numbers  of  unit  magnitude,  then 
the  set  X  obtained  by  deleting  this  column  of  S(X)  is  a  WBE  signal  set  of  size 
K  and  length  N  —  1. 

Proposition  4  can  be  applied  to  the  FDMA  signal  set  but  not  to  the  TDMA 
signal  set  described  above.  Also,  deletions  of  columns  generally  do  not  preserve 
the  MWBE  property. 

Propositions  2  and  3  together  allow  the  design  of  very  large  WBE  signal  sets 
with  distinct  signals  (or  WBE  signal  multisets  if  one  so  wishes.)  For  all  of  these, 
Irms  equals  the  right  side  of  (4)  which  is  slightly  smaller  than  \/7V-  However,  this 
does  not  mean  that  all  the  inner  product  magnitudes  are  smaller  than  a/TV.  In 
fact,  /max  can  be  considerably  larger  than  y/N.  For  example,  /max  =  LI  for  the 
union  of  two  TDMA  (or  FDM.^  WBE  signal  sets.  More  generally,  a  result  in 
[33]  shows  that  /max  exceeds  y/N  for  large  signal  sets,  even  though  ■^rms  <  y/N 
for  a  WBE  signal  set. 

The  union  construction  of  Proposition  2  requires  the  WBE  sets  to  be  of  the 
same  length.  It  is  also  possible  to  combine  WBE  sets  of  different  lengths  via  the 
Kronecker  product  construction  to  produce  a  larger  WBE  signal  set  with  longer 
signals.  Recall  that  if  ^  and  B  are  KxN  and x  matrices  respectively,  then 
their  Kronecker  (or  tensor)  product  is  the  KK  x  NN  matrix  A®  B  given  by 

•  02, 7V^ 

•  aK^NB  ^ 

WBE  signal  sets  retain  the  WBE  property  when  combined  via  the  Kronecker 
product  construction. 

Proposition  5.  If  S{X)  and  S{X)  are  WBE  signal  sets  of  sizes  K  and  K 
respectively  and  lengths  N  and  N  respectively f  then  S{X)  0  S{X)  is  a  WBE 
signal  set  of  size  KK  and  length  NN , 

Corollary  5.1.  The  Kronecker  product  of  two  MWBE  signal  sets  is  an  MWBE 
signal  set  if  and  only  if  the  constituent  MWBE  signal  sets  are  orthogonal  MWBE 
signal  sets, 

2.2  WBE  Signal  Sets  over  Small  Alphabets 

The  construction  of  WBE  signal  sets  over  small  alphabets  is  of  importance  for 
practical  applications.  The  essential  results  for  binary  signals  are  given  in  [14]. 
In  this  chapter,  these  results  are  re-stated  and  generalized  to  larger  alphabets. 

Consider  first  that  the  inner  product  of  two  signals  of  length  N  over  the  bi¬ 
nary  alphabet  {-fl,  —1}  equals  N  -2d  where  d  is  the  Hamming  distance  between 
the  signals. 


A®  B  = 


ai^iB  ai^2B 
a2,iB  o.2y2B 

[ok.I-R  0>K,2B 


It  follows  from  (1)  and  (3)  that  for  a  signal  set  of  size  K  and  length  N, 

/rms  > 

which  is  a  form  of  Welch’s  bound.  Since  /max  >  /rms)  Welch  bound  [33]  is 
usually  stated  as 

/max  =  max|{x(''^a;('))|  >  (5) 

If  equality  holds  in  (1),  then  the  signal  set  is  said  to  meet  the  Welch  bound  (4) 
on  /rms  with  equality,  and  is  called  a  WBE  signal  set  [14].  Similarly,  signal  sets 
which  meet  the  Welch  bound  (5)  on  /max  with  equality  can  be  called  MWBE 
signal  sets.  The  inner  product  magnitudes  of  MWBE  signal  sets  satisfy 

|(a;(*),x('^)|  =  for  all  ki^l. 

Examples  of  MWBE  signal  sets  are  the  AT  x  AT  discrete  Fourier  transform  matrix 
with  {k,  lyth  element  exp{j2nkl/N)  where  j  =  V^.  and^iV  •  I  where  I  is  the 
N  X  N  identity  matrix.  These  sets  contain  N  orthogonal  signals  of  length  N  and 
will  be  called  orthogonal  MWBE  signal  sets.  These  orthogonal  MWBE  signal 
sets  correspond  to  frequency-division  multiple-access  (FDMA)  and  time-division 
multiple-access  (TDMA)  signaling  respectively. 

Proposition  2.  (Massey  and  Mittelholzer  [14])-  If  X  and  X  are  WBE  signal 
sets  of  length  N  and  sizes  K  and  K,  then  X  yj  X  is  a  WBE  signal  set  of  size 
K  +  K  and  length  N. 

Note  that  it  is  not  necessary  that  X  and  X  be  different  signal  sets.  For  exam¬ 
ple,  if  the  TDMA  (or  FDMA)  signal  set  is  used  for  both  X  and  X ,  then  the 
larger  (multi)set  has  2N  signals  of  length  N.  All  inner  products  are  0  except  for 
(x(fe),x(*))  =  N,  1  <  it  <  a:,  and  both  sides  of  (1)  have  value  4At®.  However, 
/max(<^  U  X)  =  N.  These  signaling  schemes  in  effect  divide  the  2N  transmitters 
into  N  pairs  with  each  pair  using  its  allotted  time  slot  (or  frequency  slot)  as  a 
two-user  adder  channel.  As  a  final  comment,  note  that  the  union  of  two  MWBE 
signal  sets  is  generally  not  a  MWBE  signal  set. 

Given  a  WBE  signal  set  X  of  size  K  and  length  N,  Proposition  2  allows  the 
construction  of  a  AVBE  signal  multiset  of  size  LK  as  the^  union  of  L  copies  of 
A”.  It  is  also  possible  to  transform  X  into  a  different  set  X  so  as  to  ensure  that 
the  signals  in  X  X  are  distinct.  Several  “WBE-preserving”  transform atioDS 
such  as  cyclic  shifts,  multiplication  of  rows  or  columns  of  5(A')  by  constants  of 
unit  magnitude,  etc.,  are  described  in  [14].  The  next  two  propositions  describe 
general  transformations  that  include  all  these  WBE-preserving  transformations 
as  special  cases. 

Proposition  3.  If  S{X)  is  a  WBE  (MWBE)  signal  set  of  size  K  and  length  N , 
then  DS{X)U,  where  D  is  a  K  x  K  diagonal  unitary  matrix  and  U  is  a  unitary 
matrix,  is  a  WBE  (MWBE)  signal  set  X  of  size  K  and  length  N. 


2  Bounds  on  Inner  Products 


2.1  WBE  Signal  Sets 

Let  X  =  . . .  ,  denote  a  set  of  K  complex-valued  vectors  of  length 

N  where,  to  avoid  special  cases,  it  is  assumed  that  K  >  N  >  1.  The  vectors  in  X 
need  not  be  distinct,  that  is,  AT  can  be  a  multiset.  The  signal  matrix  S{X)oiX  is 

defined  to  be  the  K  x  N  matrix  whose  k-ih.  row  is  =  (a?o  ^  \  •  •  •  » 

I  <  k  <  K,  The  set  X  will  be  called  a  signal  set  of  size  K  and  length  N 
if  all  the  rows  of  S{X)  in  X  have  energy  N,  that  is,  = 

^  all  it,  1  <  fc  <  K.  Conversely,  any  K  x  N  matrix  whose 
rows  all  have  energy  N  specifies  a  signal  set  X  of  size  K  and  length  N.  The 
maximuTn  inner  product  magnitude  /max(<^)  of  a  signal  set  X  is  defined  by 


The  root-mean-square  (RMS)  inner  product  magnitude  /rms(-^)  is  the  square 
root  of  the  average  value  of  x(0)|2  over  all  k^L  Thus, 


-frms(^)  “■ 


K  K 


-1  1/2 


E 

f  1=1 


When  the  signal  set  is  obvious  from  the  context,  it  is  convenient  to  just  write 
/jnax  Sllld  Irms’  Note  also  that  /max  ^  /rms- 

Proposition  1.  (Massey  and  Mittelholzer  [14])-  For  any  signal  set  X ^ 


K  K 

(1) 

*=1 <=i 

with  equality  if  and  only  if  the  columns  y^^\y^^\  •  •  •  ,  y^^  of  S{X)  are  orthog¬ 
onal  vectors  of  energy  K,  that  is^ 


KyTn  —  n^ 

0, 


Welch  [33]  used  a  different  argument  to  prove  the  more  general  result 


K  K 


k  =  l  1=1 


(3) 


Note  that  (1)  is  the  special  case  s  =  1  of  (3).  However,  the  condition  for  equality 
stated  in  Proposition  1  was  not  known  before. 


title  and  the  statement  of  the  main  results  in  [33].  Recently,  however,  Massey 
and  his  colleagues  [14],  [20],  have  considered  sets  of  vectors  for  which  the  RMS 
magnitude  of  the  inner  product  equals  the  Welch  lower  bound.  The  capacity 
of  code-division  multiple-access  (CDMA)  systems  in  which  these  so-called  WBE 
signal  sets  are  used  as  the  signature  sequences  has  been  studied  in  [20].  Following 
this,  other  authors  [31],  [32]  have  considered  the  signal-to-interference  ratio  in 
CDMA  systems  when  WBE  signal  sets  are  used  as  signature  sequences. 

In  this  chapter,  both  the  RMS  magnitude  and  the  maximum  magnitude  of 
the  inner  product  are  considered.  Signal  sets  whose  maximum  inner  product 
magnitude  equals  the  Welch  bound  (hereafter  called  MWBE  signal  sets)  form 
a  subclass  of  WBE  signal  sets,  and  are  also  described.  The  results  in  [14]  are 
generalized  in  several  obvious  ways  from  linear  binary  codes  and  their  cosets  to 
linear  nonbinary  codes  and  their  cosets,  and  from  WBE  signal  sets  to  MWBE 
signal  sets.  It  is  shown  that  almost  all  interesting  codes  are  WBE  signal  sets  - 
indeed,  in  stark  contrast  to  the  folk  theorem  that  “all  codes  are  good  except  the 
ones  we  can  think  of’  one  can  assert  that 

“All  codes  are  WBE  signal  sets  except  the  ones  we  don’t  want  to  think  of!” 

Some  of  the  WBE  signal  sets  described  in  [14]  are  actually  MWBE  signal  sets. 
Large  sets  of  signals  whose  maximum  inner  product  magnitudes  are  only  very 
slightly  larger  than  the  Welch  bound  are  also  described.  All  these  constructions 
provide  signal  sets  whose  sizes  and  alphabets  are  restricted  in  various  ways. 
A  new  result  in  this  chapter  is  a  Kronecker  product  construction  that  provides 
WBE  signal  sets  of  arbitrary  size.  A  construction  of  WBE  signal  sets  over  alpha¬ 
bets  whose  size  is  a  prime  power  is  also  described.  The  most  important  practical 
application  of  the  latter  result  is  to  the  design  of  signal  sets  over  quaternary 
alphabets  for  use  in  quadriphase  communication  systems.  Next,  it  is  noted  that 
subsets  of  WBE  signal  sets  generally  are  not  WBE  signal  sets.  The  effect  of 
this  on  system  analyses  and  information-theoretic  analyses  of  channel  capacity 
is  briefly  described. 

For  asynchronous  CDMA  communication  systems,  various  correlation  pa¬ 
rameters  of  the  signal  sets  are  more  important  than  the  inner  products.  The 
Welch  lower  bounds  [33]  on  the  RMS  and  maximum  magnitudes  of  the  periodic, 
odd,  and  aperiodic  correlation  functions  have  been  studied  previously  [18] ,  [21] , 
[23],  [24].  The  same  bounds  also  apply  to  the  RMS  and  maximum  magnitudes 
of  the  more-recently  introduced  polyphase  correlation  functions  [4],  [28]  which 
include  the  periodic  and  odd  correlation  functions  as  special  cases.  Welch  also 
proved  more  general  bounds  on  the  periodic  and  aperiodic  correlation  function 
magnitudes.  It  is  noted  that  these  also  apply  to  the  polyphase  correlation  func¬ 
tions.  It  is  also  noted  that  signal  sets  (such  as  WBE  signal  sets)  that  meet  the 
Welch  bounds  on  RMS  correlation  are  also  complementary  signal  sets.  Finally, 
the  chapter  considers  the  signal-to-noise  ratio  parameter  [1  i^]  for  asynchronous 
CDMA  communication  systems  that  use  WBE  signal  sets  as  sign  ature  sequences. 
It  is  noted  that  in  many  cases,  better  than  average  performance  can  be  guaran¬ 
teed  if  WBE  signal  sets  are  used.  The  chapter  concludes  with  a  brief  discussion 
of  these  results. 


Meeting  the  Welch  Bound  with  Equality 


Dilip  V.  Sarwate 


Department  of  Electrical  and  Computer  Engineering 
and  the  Coordinated  Science  Laboratory 
1308  West  Main  Street 
Urbana  IL  61801  USA 
Emml:  scirwate@commxsLuiuc.edu 


Abstract.  A  signal  set  whose  root-mean-square  inner  product  magni¬ 
tude  equals  the  Welch  lower  bound  is  called  a  WBE  signal  set.  WBE 
signal  sets  are  of  interest  in  synchronous  CDMA  communication  sys¬ 
tems,  This  chapter  surveys  the  known  results  on  WBE  signal  sets  and 
extends  them  in  several  ways.  In  particular,  WBE  sign2il  sets  over  sig¬ 
nal  alphabets  whose  size  is  a  prime  power  (the  most  important  case  is 
the  quaternary  alphabet),  and  arbitrary-size  WBE  signal  sets  o-ver  small 
alphabets  are  constructed.  Constructions  are  also  described  for  signal 
sets  whose  maximum  inner  product  magnitudes  equal  (or  are  only  very 
slightly  larger  than)  the  Welch  boimd.  Similarly,  signal  sets  whose  root- 
mean- square  correlation  magnitudes  equal  the  Welch  lower  bound  on 
correlations  are  of  interest  in  cisynchronous  CDMA  communication  sys¬ 
tems.  It  is  shown  that  the  root-mean-square  correlation  magnitude  of 
aU  WBE  signal  sets  (and  many  more)  equals  the  Welch  boimd.  Finally, 
the  signal-to-noise  ratio  for  CDMA  communication  systems  using  WBE 
sign2il  sets  is  studied. 


1  Introduction 

A  quarter-century  ago,  Welch  [33]  published  a  collection  of  lower  bounds  on 
the  maximum  magnitude  of  the  inner  products  of  a  set  of  unit-norm  complex¬ 
valued  vectors,  and  used  these  results  to  deduce  lower  bounds  on  the  maximum 
magnitudes  of  the  periodic  and  aperiodic  correlation  functions  for  sets  of  periodic 
sequences.  The  method  used  was  a  familiar  one  -  Welch  found  lower  bounds  on 
the  root-mean-square  (RMS)  magnitude,  and  then  asserted  that  the  maximum 
magnitude  could  not  be  smaller  than  these  lower  bounds.  In  the  years  that 
followed  the  publication  of  the  Welch  bounds,  the  statement 

‘The  set  satisfies  the  Welch  bound  with  equality” 

was  generally  understood  [1],  [8],  [10],  [15],  [21]  to  mean  that  the  maximuTn 
magnitude  of  the  parameter  under  consideration  was  equal  to  the  corresponding 
lower  bound  given  in  [33].  Indeed,  this  interpretation  is  supported  by  both  the 
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Figure  4.  Plot  of  optimal  An  versus  Bmax  in  the 
linear  case 


Figure  5.  Plot  of  optimal  An  versus  Bmax  in  the 
2-D  case 


subject  to  Pb  <  Bmax  (2) 

A  similar  optimization  problem  is  defined  for  the  FCA 
scheme,  except  that  An  is  the  only  tuning  parameter 
in  that  case.  In  Figure  4,  we  plot  the  optirnal  values 
of  An  versus  Bmax  for  both  schemes  for  the  linear  case 
with  r  =  2  and  r  =  4,  In  Figure  5,  we  plot  the  optimal 
values  of  An  versus  Bmax  for  both  schemes  for  the  2-D 
case  with  i?  =  3,  7,  12,  and  19.  For  both  figures,  we  use 
//  =  1.  To  make  our  results  comparable  for  different 
minimum  reuse  factors,  we  keep  N/r  =  15  for  all  the 
curves  in  Figure  4.  Thus,  iV  =  30  when  r  =  2,  and 
AT  =  60  when  r  =  4.  Similarly,  in  Figure  5,  we  set 
N/R  =  15  for  all  R. 

From  Figures  4  and  5,  we  observe  that  for  any 
Pb  ^  Bmax  constraint,  our  channel  sharing  scheme  al¬ 
lows  a  higher  call  rate  than  the  FCA  scheme.  For  a 
typical  constraint  of  <  10“^,  our  scheme  can  admit 
approximately  20%  more  calls  into  the  network  than 
the  FCA  scheme  with  r  =  4  in  the  linear  case  or  with 
P  =  7  in  the  2-D  case.  Note  that  in  the  2-D  case, 
even  with  the  minimum  possible  reuse  factor  ii  =  3, 
our  scheme  outperforms  the  FCA  scheme  by  more  than 
10%.  Of  course,  when  the  reuse  factor  becomes  larger, 
the  improvement  also  increases  (e.g.,  30%  for  ii  =  19). 
Also,  as  shown  in  the  figures,  for  lower  blocking  prob¬ 
ability  constraints,  the  improvement  is  even  greater. 


We  next  investigate  the  protocol  complexity  for 
those  optimal  combined  sharing  schemes  obtained  for 
Figures  4  and  5.  We  first  estimate  the  expected  number 
of  inter-cell  communications  used  to  serve  a  call  arrival, 
by  dividing  the  total  number  of  sharing  requests  that 
have  been  submitted  in  simulation  to  the  total  num¬ 
ber  of  call  arrivals.  Here  we  count  all  sharing  request 
signals,  including  those  that  are  not  granted.  We  find 
that  the  expected  number  of  sharing  requests  per  call 
arrival  does  not  exceed  0.2  in  all  the  cases  shown  in 
Figure  4,  and  0.6  for  those  in  Figure  5.  There  is  a  dif¬ 
ference  between  the  linear  and  2-D  cases,  because  in 
the  latter,  each  cell  has  more  adjacent  cells,  and  thus 
more  sharing  requests  have  to  be  sent.  We  next  esti¬ 
mate  the  expected  number  of  intra-cell  channel  swap¬ 
pings  needed  to  serve  a  call  arrival,  by  dividing  the 
total  number  of  channel  swappings  observed  in  simula¬ 
tion  to  the  total  number  of  call  arrivals.  We  find  that 
the  maximum  number  of  channel  swappings  per  call 
arrival  needed  is  less  than  0.04  for  the  combined  shar¬ 
ing  schemes  in  Figure  4,  and  less  than  0.25  for  those  in 
Figure  5,  These  results  suggest  that  the  computational 
overhead  to  implement  our  protocol  is  not  significant. 

5.  Conclusion 

We  have  presented  a  novel  channel  sharing  scheme 
to  improve  system  capacity  in  wireless  cellular  sys¬ 
tems.  Our  basic  idea  is  to  allow  channels  to  be  shared 
between  adjacent  cells  without  co-channel  coordina¬ 
tion  with  other  cells.  For  this  purpose,  we  introduce 
the  concept  of  meta-cells  to  facilitate  localized  channel 
management.  Further,  to  maximize  channel  reuse  eflB- 
ciency,  we  develop  a  channel  assignment  method  based 
on  the  distance  measure  between  meta-cells.  An  at¬ 
tractive  feature  of  the  channel  sharing  scheme  is  that 
it  does  not  require  complex  power  control  techniques, 
global  channel  coordination,  or  on-line  optimization, 
which  simplifies  its  implementation.  In  this  paper,  we 
only  consider  the  channel  assignment  problem  with¬ 
out  taking  into  account  call  handoff.  In  [3],  we  de¬ 
scribe  how  the  channel  sharing  scheme  can  be  applied 
to  address  the  call  handoff  problem.  We  find  that  our 
scheme  significantly  reduces  handoff  blocking  probabil¬ 
ity  and  improves  system  capacity  over  a  large  range  of 
traffic  conditions. 
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assigned  an  idle  channel,  and  the  state  of  that  channel 
is  then  changed  to  “BUSY.”  Otherwise,  we  atternpt 
to  obtain  an  idle  accessible  channel  from  some  sharing 
cell,  as  follows.  We  first  look  up  all  accessible  chan¬ 
nels  currently  labeled  “DISABLED”  in  the  local  cell, 
and  get  a  list  of  the  sharing  cells  associated  with  those 
“DISABLED”  accessible  channels.  We  then  choose  one 
of  the  sharing  cells  on  this  list,  and  send  it  a  sharing 
request.  The  sharing  cell,  upon  receiving  the  sharing 
request,  then  executes  the  procedure  described  below. 
If  the  sharing  request  is  granted,  we  accept  the  arriving 
call  by  assigning  the  accessible  channel  specified  in  the 
return  message  from  the  sharing  cell,  and  setting  the 
state  of  that  channel  “BUSY”.  If  the  request  is  denied, 
we  proceed  to  send  another  sharing  request  to  one  of 
the  remaining  sharing  cells  on  the  list.  The  process  is 
continued  until  the  sharing  request  is  accepted,  or  de¬ 
nied  by  all  sharing  cells  on  the  list,  in  which  case  the 
call  is  rejected  (blocked). 

When  a  sharing  request  is  received  from  a  sharing 
cell,  we  first  check  whether  there  is  any  accessible  chan¬ 
nel  U,  associated  with  that  sharing  cell,  currently  la¬ 
beled  “IDLE”  in  the  local  cell.  If  so,  we  grant  the  shar¬ 
ing  request  by  sending  back  the  identification  number 
of  channel  U  in  the  return  message,  and  setting  channel 
U  “DISABLED”  in  the  local  look-up  table.  Otherwise, 
we  attempt  to  obtain  such  an  idle  channel  U  by  swap¬ 
ping  channels  as  follows.  We  check  whether  there  is 
any  accessible  channel  V  currently  labeled  “IDLE”  in 
the  local  cell.  If  so,  we  find  a  user  currently  using  an 
accessible  channel  U  associated  with  the  sharing  cell. 
That  user  then  releases  channel  U  and  grabs  channel 
Vj  so  that  we  can  grant  the  sharing  request  by  return¬ 
ing  the  identification  number  of  channel  U,  and  setting 
channel  U  as  “DISABLED”  in  the  local  look-up  table. 
If  no  accessible  channels  are  in  the  “IDLE”  state,  we 
reject  the  sharing  request. 

When  a  call  leaves  the  local  cell,  we  simply  label  the 
associated  channel  “IDLE.” 

The  following  are  some  of  the  important  features  of 
our  channel  sharing  protocol. 

1.  Channel  sharing  within  a  meta-cell  is  localized  be¬ 
tween  two  adjacent  cells  and  can  therefore  be  done 
in  a  decentralized  fashion.  No  global  coordination 
is  necessary  in  our  protocol.  The  major  computa¬ 
tional  effort  in  the  local  cell  includes  communica¬ 
tion  with  the  sharing  cells  and  channel  swappings. 

2.  Channel  utilization  is  improved^  because  channels 
are  assigned  to  meta-cells,  instead  of  being  as¬ 
signed  to  cells.  Specifically,  an  idle  channel  can  be 
accessed  by  calls  in  either  of  its  component  cells, 
thus  effectively  reducing  the  fraction  of  idle  peri¬ 
ods  of  channels. 

3.  Call  blocking  may  be  decreased^  since  users  have 

access  to  statistically  more  portion  of  channel  ca¬ 
pacity.  For  example,  in  the  linear  cellular  system, 
each  cell  is  covered  by  two  overlapping  meta-cells. 
Users  in  each  cell  may  thus  have  access  to  ^  chan¬ 
nels  (at  most),  compared  with  y  channels  using 
the  FCA  scheme.  Note  that  ^  >  y,  since 

r  >  2.  Therefore,  our  scheme  can  provide  more 


channel  capacity  than  the  FCA  scheme.  It  can  be 
shown  that  a  similar  result  holds  for  the  2-D  case 
as  well. 

4.  Combined  Scheme  and  Simulation  Results 

It  is  instructive  to  compare  the  effective  channel  ca¬ 
pacity  using  our  sharing  scheme  and  the  FCA  scheme. 
As  we  pointed  out  above,  in  an  individual  cell,  the 
number  of  accessible  channels  in  our  scheme  is  gener¬ 
ally  larger  than  that  of  the  FCA  scheme.  Note  that, 
however,  this  number  is  an  upper  bound  for  the  effec¬ 
tive  channel  capacity  since  it  assumes  that  the  statis¬ 
tical  sharing  advantage  can  be  fully  exploited.  On  the 
other  hand,  if  we  totally  ignore  the  statistical  sharing 
advantage,  we  may  then  obtain  a  lower  bound  for  the 
channel  capacity.  The  effective  channel  capacity  using 
our  sharing  scheme  is  bounded  by  these  two  extreme 
scenarios,  and  depends  on  the  extent  to  which  the  sta¬ 
tistical  sharing  advantage  is  actually  obtained. 

Next,  we  describe  a  combined  scheme  that  attempts 
to  maximize  the  effective  channel  capacity  for  each 
cell.  Specifically,  we  divide  the  N  given  channels 
into  two  distinct  groups  of  size  Ni  and  N2  such  that 
N  =  Ni  N2-  The  first  group  of  Ni  channels  is  as¬ 
signed  to  cells  using  the  FCA  scheme,  and  thus  cannot 
be  shared  between  adjacent  cells.  The  remaining  group 
of  N2  channels,  however,  is  assigned  to  meta-cells  ac¬ 
cording  to  our  channel  sharing  assignment  scheme,  and 
can  be  shared  within  meta-cells,  the  same  as  in  the  pre¬ 
vious  sections. 

Varying  N2  allows  us  to  maximize  the  effective  chan¬ 
nel  capacity.  Specifically,  the  parameter  N2  enables 
us  to  trade  off  the  potential  advantage  from  statisti¬ 
cal  sharing  with  a  reduction  in  the  average  number  of 
channels  for  each  cell.  The  larger  the  value  of  N2  in 
the  combined  scheme,  the  more  the  gain  we  expect  from 
statistical  sharing,  but  at  the  same  time  the  higher  the 
price  we  pay  by  reducing  the  average  number  of  chan¬ 
nels  for  each  cell. 

We  next  numerically  compare  the  performance  of 
the  combined  channel  sharing  scheme  with  the  FCA 
scheme.  We  investigate  both  the  linear  and  2-D  hexag¬ 
onal  cases  by  simulation.  Our  simulation  model  for 
the  linear  cellular  system  consists  of  30  cells,  where 
each  cell  has  two  adjacent  cells.  The  boundary  cells 
on  the  two  sides  are  then  connected  to  each  other  to 
avoid  “edge”  effects  at  the  boundaries.  To  model  the  2- 
D  cellular  system,  we  use  a  36-cell  configuration,  with 
each  cell  having  six  adjacent  cells.  Again,  the  cells  on 
the  boundary  of  one  side  of  the  configuration  are  con¬ 
nected  to  the  cells  on  the  boundary  of  the  other  side. 
For  both  the  linear  and  2-D  cases,  calls  are  assumed  to 
arrive  at  each  cell  according  to  a  Poisson  process  with 
rate  A„.  The  time  until  a  call  terminates  is  assumed  to 
be  exponentially  distributed  with  mean  l/fi. 

We  compare  the  maximum  call  arrival  rate  An  that 
can  be  admitted  by  our  channel  sharing  scheme  and 
the  FCA  scheme,  respectively,  for  various  values  of  the 
blocking  probability  Pt-  More  precisely,  we  define  the 
following  optimization  problem  for  our  sharing  scheme: 

maximize  An 
An  >^2 


Figure  3.  Configuration  of  co-channel  meta 
cells  in  the  case  of  p  +  1  >  g. 


radius  of  a  single  cell.  In  general,  the  minimum  reuse 
factor  R  can  be  represented  by  J?  =  +  M  +  where 
p  and  q  are  nonnegative  integers  satisfying  p  +  g  >  2. 
Without  loss  of  generality,  we  assume  that  g  >  p  in  the 
following. 

As  in  the  linear  case,  a  meta-cell  here  consists  of 
two  adjacent  neighboring  cells.  To  allow  for  channel 
sharing,  we  assign  channels  to  meta-cells,  under  the 
minimum  reuse  distance  constraint  in  terms  of  meta¬ 
cells.  For  our  purpose,  the  distance  measure  between 
two  meta-cells  is  defined  in  the  same  way  as  in  Equa¬ 
tion  (1),  and  we  attempt  to  deploy  co-channel  meta¬ 
cells  as  close  as  possible.  In  the  remainder  of  this  sec¬ 
tion,  we  describe  this  channel  assignment  scheme. 

Consider  an  arbitrary  cell  Aq  in  the  hexagonal  cel¬ 
lular  system  as  shown  in  Figure  3,  and  establish  a  set 
of  coordinates  i-j  originating  at  the  center  of  cell  ylg, 
where  the  two  axes  form  a  60^  angle,  and  the  unit 
distance  along  the  axes  is  '\/3p.  Then,  the  minimum 
reuse  distance  A  is  equal  to  R  coordinate  units  in  this 
set  of  coordinates.  The  center  of  any  hexagon  Z  can 
be  designated  by  a  pair  of  coordinates  with 

iz  and  jz  integers.  Let  Bo,  Co,  Dq,  Eq,  Fq,  and  Go 
denote  six  hexagons  with  centers  at  the  coordinates 

(P,?).  (-9,P  +  ^.  (-P  -9.P).  (-P.-?).  (9.-P-9)i 
and  (p  +  g,“*p),  respectively  (see  Figure  3).  Note  that 
their  distances  to  cell  Ao  (centered  at  (0,0))  are  equal 
and  given  by  p^  +  pg  +  g^  =  R  units.  In  the  conven¬ 
tional  FCA  scheme,  cells  Bo,  Go,  Do,  Eo,  Fo,  and  Go 
are  assigned  the  same  set  of  channels  as  cell  Aq  . 

Let  Ai  be  the  adjacent  right  neighboring  cell  of  Aq 
along  the  2-axis  as  shown  in  Figure  3,  i.e.,  (Ui,iAi)  = 
(Uo  d-  ^Jao)  —  (1)0).  We  next  focus  on  meta-cell 
(AojAi),  Note  that  in  the  2-D  case,  meta-cells  may 
have  three  distinct  orientations,  and  each  meta-cell 
may  have  six  neighboring  co-channel  meta-cells.  Con¬ 
sequently,  there  are  an  infinite  number  of  different 
ways  to  deploy  co-channel  meta-cells,  including  irreg¬ 
ular  (non-repetitive)  deployment  patterns.  For  the 
sake  of  simplicity  and  regularity,  here  we  restrict  our¬ 
selves  to  deployment  methods  in  which  all  co-channel 
meta-cells  are  of  the  same  orientation.  In  other  words, 
for  any  co-channel  meta-cell  of  (Ao,Ai),  two  compo¬ 


nents  are  adjacent  along  the  i-axis.  Therefore,  we  can 
identify  any  co-channel  meta-cell  of  reference  meta-cell 
(Ao,  Ai)  by  its  left  component. 

It  turns  out  that  for  different  values  of  p  and  g,  we 
need  different  methods  to  deploy  co-channel  meta-cells 
of  reference  meta-cell  (Aq.Ai),  depending  on  whether 
p+l>  q.  For  simplicity,  here  we  discuss  only  the  case 
of  p  -h  1  >  g.  The  values  of  R  within  this  category  are 
3,7, 12, 19,  etc.  It  can  be  shown  that  in  this  case,  as 
illustrated  in  Figure  3,  the  meta-cells  whose  left  com¬ 
ponents  are  respectively  Fi,  Go,  F_i,  F_i,  Fq,  and 
Gi,  can  be  deployed  as  co-channel  meta-cells  of  refer¬ 
ence  meta-cell  (Aq,  Ai),  where  the  centers  of  cells  Fi, 
D^i,  E^i,  and  Gi  are  given  respectively  as  follows: 
(iBiJBi)  =  (p+1,9).  =  (-p-9-l>P). 

(tE-iJE-i)  =  i-p-  1,-9),  and  [iorJaCi  =  (p+9  + 
\^—p).  Furthermore,  we  have  shown  that  when  as¬ 
signing  channels  according  to  this  co-channel  meta-cell 
deployment,  the  channel  reuse  factor  is 

F'  =  F-I-P  +  g. 

A  detailed  proof  and  further  discussion  of  the  above 
co-channel  meta-cell  deployment  are  given  in  [3]. 

3.  Channel  Sharing  Protocol 

We  next  develop  a  channel  sharing  protocol  to  ex¬ 
ploit  this  sharing  feature  of  our  scheme.  For  the  sake 
of  description,  we  focus  our  attention  on  a  particular 
(arbitrary)  cell,  which  we  call  the  local  cell.  Recall  from 
the  previous  section  that  using  our  channel  assignment 
scheme,  each  cell  in  the  cellular  network  is  always  cov¬ 
ered  by  several  overlapping  meta-cells,  called  the  cov¬ 
ering  meta-cells  of  that  cell.  We  call  channels  that  are 
assigned  to  the  covering  meta-cells  of  the  local  cell  the 
accessible  channels,  since  users  in  the  local  cell  may 
have  access  to  those  channels. 

At  any  given  instance,  each  accessible  channel  can 
be  used  in  only  either  the  local  cell  or  one  of  the  adja¬ 
cent  cells.  In  other  words,  not  all  the  accessible  chan¬ 
nels  can  be  used  by  the  local  cell  at  a  given  instance. 
Those  channels  that  can  be  used  by  the  local  cell  are 
called  enabled  channels.  To  coordinate  the  use  of  ac¬ 
cessible  channels  between  the  local  cell  and  the  corre¬ 
sponding  sharing  cells,  the  local  cell  maintains  a  look¬ 
up  table,  in  which  each  accessible  channel  occupies  one 
entry,  recording  the  current  state  of  that  channel.  The 
state  of  this  entry  may  be  “DISABLED” ,  “IDLE” ,  or 
“BUSY”,  A  channel  labeled  “DISABLED”  is  not  en¬ 
abled  in  the  local  cell.  This  indicates  that  the  channel 
is  currently  enabled  in  the  sharing  cell  associated  with 
that  channel.  The  local  cell  can  use  only  those  ac¬ 
cessible  channels  labeled  “BUSY”  or  “IDLE”.  In  this 
case  the  labels  further  indicate  whether  channels  are 
currently  occupied  by  mobile  users  in  the  local  cell. 

Our  protocol  can  be  described  in  three  main  parts, 
corresponding  to  three  different  possible  scenarios:  ar¬ 
rival  of  a  call,  sharing  request  from  a  sharing  cell,  and 
departure  of  a  call. 

When  a  call  arrives,  we  check  if  there  are  any  accessi¬ 
ble  channels  labeled  “IDLE”  in  the  look-up  table  of  the 
local  cell.  If  there  are,  the  arriving  call  is  accepted  and 
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Figure  1 .  Meta-cells  in  the  1-D  ceiiuiar  system. 


used  in  cell  A! ,  Such  a  channel  borrowing  scheme  re¬ 
quires  excessive  exchange  of  channel  usage  information 
between  remote  cells,  and  channel  utilization  may  also 
be  decreased  at  high  traffic  loads  [1]. 

In  our  scheme,  we  attempt  to  alleviate  call  block¬ 
ing  by  sharing  channels  between  neighboring  cells.  To 
facilitate  the  description  of  our  scheme,  we  first  intro¬ 
duce  the  concept  of  meta-celL  A  meta-cell  is  a  fixed 
collection  of  neighboring  cells  (typically  a  pair  of  two 
adjacent  cells).  For  example,  Figure  1  shows  a  fam¬ 
ily  of  meta-cells  in  a  linear  (1-D)  cellular  system,  each 
comprising  a  pair  of  two  adjacent  cells.  Each  meta¬ 
cell  is  designated  by  a  pair  (A,  Y),  where  X  and  Y 
are  individual  cells  called  the  component  cells  of  that 
meta-cell.  For  example,  in  Figure  1,  cells  A  and  B  are 
components  of  meta-cell  (A,  j5).  As  before,  we  assume 
that  we  have  a  distance  measure  for  meta-cells  (e.g., 
based  on  the  distance  measure  d  between  cells) . 

The  main  idea  of  our  channel  assignment  scheme  is 
to  allocate  channels  to  meta-cells  in  such  a  way  that  a 
maximum  number  of  channels  can  be  assigned  to  each 
meta-cell  while  any  two  meta-cells  assigned  the  same 
channels  satisfy  the  minimum  reuse  distance  require¬ 
ment,  i.e.,  the  distance  measure,  now  with  respect  to 
two  meta-cells,  is  no  shorter  than  A.  Our  channel  as¬ 
signment  scheme,  done  at  the  meta-cell  level,  is  differ¬ 
ent  from  the  usual  channel  assignment  scheme  which 
is  done  at  the  cell  level.  In  particular,  channels  are  as¬ 
signed  not  to  individual  cells  but  to  meta-cells.  A  user 
in  a  given  cell  can  use  any  channel  that  is  assigned  to 
a  meta-cell  to  which  the  given  cell  belongs,  with  the 
usual  proviso  that  only  one  user  can  use  a  given  chan¬ 
nel  within  a  given  meta-cell  at  a  given  instance.  Our 
channel  allocation  scheme  allows  channels  to  be  shared 
between  neighboring  cells  (namely,  cells  belonging  to 
the  same  meta-cell).  Note  that  relative  to  the  usual 
FCA  scheme,  in  our  sharing  scheme  we  need  to  co¬ 
ordinate  between  cells  in  a  meta-cell.  However,  since 
meta-cells  consist  only  of  neighboring  cells,  the  scheme 
should  be  easy  to  implement. 

In  the  following,  we  illustrate  our  channel  assign¬ 
ment  scheme  for  two  typical  cellular  systems:  linear 
and  2-D  hexagonal.  We  develop  methods  to  deploy 
meta-cells  assigned  the  same  channels  cis  close  as  pos¬ 
sible,  while  not  violating  the  minimum  reuse  distance 
requirement. 

Consider  again  Figure  1.  For  this  simple  linear  cel¬ 
lular  system,  the  distance  measure  dx,Y  between  two 
ceils  X  and  Y  is  usually  given  as  dx,Y  =  kx  “  cy|, 
where  cx  and  cy  denote  the  positions  of  the  centers 
of  cells  X  and  Y,  respectively.  Suppose  that  the  mini¬ 
mum  reuse  distance  is  A  =  rL,  where  L  is  the  width  of 
a  single  cell  and  r  is  an  integer.  Cells  that  are  assigned 
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Figure  2.  Meta-cell  based  channel  assignment 
scheme  in  the  1-D  cellular  system. 


the  same  set  of  channels  are  called  co-channel  cells. 
Therefore,  in  the  FCA  scheme,  co-channel  cells  are  ex¬ 
actly  r  cells  apart.  Let  N  denote  the  total  number  of 
distinct  channels  that  are  available  in  this  linear  cellu¬ 
lar  system.  Thus,  the  total  number  of  distinct  channels 
allocated  to  each  cell  is  N/r,  The  integer  r  is  called 
the  reuse  factor. 

To  assign  channels  to  meta-cells,  we  next  derive  the 
distance  measure  d(x,y),(x^yq  between  two  meta-cells 
(X,  Y)  and  (X',Y').  Recall  that  in  our  scheme,  when 
a  channel  is  assigned  to  a  meta-cell,  it  can  be  used 
by  a  mobile  user  in  any  cell  belonging  to  that  meta¬ 
cell.  Thus,  we  have  to  ensure  that  the  distance  mea¬ 
sure  between  any  component  cells  of  two  meta-cells  as¬ 
signed  the  same  set  of  channels  complies  with  the  min¬ 
imum  reuse  distance  requirement.  Consequently,  we 
define  d(x,y),(X',yq  ^  fhe  minimum  of  the  distance 
measures  between  the  component  cells  of  meta-cells 
(X,Y),(X'Y'),i.e., 

d(x,Y),{x\Y^)=  min{dx,x^ ,  dx,ys  dy,xq  cfy,y^}-  (1) 

For  example,  in  Figure  1,  the  distance  measure  between 
meta-cells  (A,J5)  and  (A',  5')  is  given  by  (r  -  1)L, 
which  is  the  distance  between  cells  B  and  A'. 

We  call  meta-cells  that  are  assigned  the  same  set 
of  channels  co- channel  meta- cells.  To  allocate  a  maxi¬ 
mum  number  of  channels  to  each  meta-cell,  co-channel 
meta-cells  have  to  be  deployed  as  close  as  possible. 
Therefore,  we  assign  the  same  set  of  channels  to  meta¬ 
cells  that  are  exactly  the  minimum  reuse  distance 
apart,  i.e.,  rL  in  this  case.  For  example,  in  Figure  2, 
meta-cells  (A,  B)  and  {B',C^)  are  assigned  the  same  set 
of  channels.  It  is  easy  to  show  that,  using  our  scheme, 
each  meta-cell  is  assigned  N/ (r  -j- 1)  distinct  channels. 
In  other  words,  the  reuse  factor  of  our  channel  assign¬ 
ment  scheme  is 

r'  =  r  H-  1. 

In  the  planar  or  2-D  case,  the  cellular  configuration 
is  usually  assumed  to  be  hexagonal.  As  in  the  linear 
case,  the  distance  dx,Y  between  two  cells  X  and  Y 
is  usually  defined  as  the  distance  between  their  cen¬ 
ters.  Let  N  denote  the  total  number  of  distinct  chan¬ 
nels  available  in  this  hexagonal  cellular  system.  In  the 
FCA  scheme,  co-channel  cells  are  exactly  a  distance  of 
A,  the  minimum  reuse  distance,  apart.  Consequently, 
each  cell  is  assigned  N/R  channels,  where  R  is  the  min¬ 
imum  reuse  factor.  It  has  been  shown  that  R  can  be 
determined  from  Ahy  R=  (1/3)(A//?)^,  where  p  is  the 
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Abstract 

Enhancing  system  capacity  is  an  important  issue  in 
wireless  cellular  networks.  In  this  paper^  we  present 
a  new  channel  sharing  scheme  to  address  this  problem. 
Our  basic  idea  is  to  allow  channels  to  be  shared  between 
adjacent  cells.  We  propose  a  fixed  channel  assignment 
scheme  to  maximize  channel  reuse  efficiency  while  al¬ 
lowing  channel  sharing.  An  important  feature  of  our 
sharing  scheme  is  that  channel  management  is  local¬ 
ized  between  adjacent  cells^  and  no  global  coordination 
or  optimization  is  required^  thus  simplifying  implemen¬ 
tation.  We  provide  simulation  results  comparing  our 
scheme  with  the  fixed  channel  assignment  scheme. 


1.  Introduction 

Worldwide  focus  on  wireless  networking  research  has 
intensified  in  recent  years.  Compared  to  its  wired  coun¬ 
terpart,  wireless  spectrum  is  a  much  inore  scarce  re¬ 
source.  Thus,  enhancing  system  capacity  is  of  great 
importance  in  wireless  networks.  The  use  of  cellular 
technology  is  a  common  means  to  this  end.  In  a  cel¬ 
lular  system,  the  service  area  is  divided  into  cells,  and 
the  wireless  spectrum  is  reused  among  those  cells.  We 
refer  to  the  unit  of  wireless  spectrum  needed  to  serve  a 
single  user  as  a  channel.  For  example,  in  a  TDMA  sys¬ 
tem,  a  time-slot  is  viewed  as  a  channel.  We  adopt  the 
usual  assumption  that  channels  used  in  one  cell  cannot 
be  used  in  other  cells  that  are  closer  than  the  minimum 
reuse  distance.  A  significant  body  of  research  has  been 
conducted  on  efficiently  allocating  channels  to  individ¬ 
ual  cells  under  this  minimum  reuse  distance  constraint 
[2].  There  are,  in  general,  two  types  of  channel  allo¬ 
cation  schemes:  Fixed  Channel  Allocation  (FCA)  and 
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Dynamic  Channel  Allocation  (DCA). 

In  this  paper,  we  propose  a  novel  channel  sharing 
scheme  to  improve  system  capacity  in  wireless  cellular 
networks.  The  basic  idea  in  our  channel  sharing  scheme 
is  to  allow  channels  to  be  shared  between  adjacent  cells; 

1. e.,  channels  can  be  used  by  any  user  in  either  cell, 
without  coordinating  with  other  cells  for  the  use  of  the 
same  channels.  In  this  way,  channel  management  can 
be  localized  within  adjacent  cells.  Further,  to  avoid 
the  need  for  centralized  optimization,  we  propose  a  new 
fixed  channel  assignment  scheme  that  attempts  to  keep 
the  co-channel  reuse  distance  as  close  as  possible  while 
allowing  channel  sharing. 

2.  Localized  Channel  Sharing  Assignment  Scheme 

Consider  a  set  of  cells  in  a  cellular  system.  Each 
cell  contains  a  base  station,  which  communicates  with 
mobile  users  in  that  cell.  Associated  with  the  cellular 
system  is  a  set  of  channels,  which  are  to  be  allocated  to 
the  cells.  In  each  cell,  an  individual  channel  can  be  used 
by  only  one  mobile  user  in  the  cell  for  communication 
with  the  base  station.  The  same  channel  can  be  reused 
in  two  different  cells  as  long  as  they  (the  cells)  satisfy  a 
minimum  reuse  distance  requirement.  To  elaborate,  we 
assume  that  we  are  given  a  distance  measure  d,  where 
dx,Y  is  the  distance  between  cells  X  and  Y .  We  are 
also  given  a  parameter  A  that  represents  the  minimum 
reuse  distance.  Two  cells  X  and  Y  are  said  to  satisfy 
the  minimum  reuse  distance  requirement  if  dx,Y  >  A. 

In  the  conventional  FCA  scheme,  each  channel  is  as¬ 
signed  to  cells  that  are  exactly  a  distance  A  apart.  For 
example,  cells  A  and  A'  are  assigned  the  same  set  of 
channels.  In  cellular  systems  using  the  FCA  scheme, 
calls  in  a  cell  can  only  use  those  channels  assigned  to 
that  cell.  Call  blocking  results  if  such  a  channel  is  not 
available.  Channel  borrowing  is  one  way  to  reduce  call 
blocking.  Specifically,  if  at  the  time  the  call  arrives 
at  cell  B,  cell  B  does  not  have  any  idle  channel,  but 
cell  A  has  some  idle  channels,  then  cell  B  may  bor¬ 
row  one  of  the  idle  channels  to  serve  the  new  call.  To 
achieve  this,  however,  we  need  to  coordinate  the  use  of 
channels  in  the  co-channel  cells  of  cell  A.  For  exam¬ 
ple,  channels  that  are  borrowed  by  cell  5,  cannot  be 


Algorithm 

Compare 

Add 

Multiply 

Divide 

Exp 

Storage 

OSOME 

1 

2^-\2Ay  +  L-K  +  4) 

2^{L-K-\-2) 

1 

2^ 

2^-^[f  +  fiL-K  +  l)] 

SSOME 

2^-\L-  K  +  2) 

2^-^{2A^+2L-2K  +  Q) 

0 

1 

2^ 

2^-^[f  +  {f  +  l)(L-K  +  l)] 

SSOMEl 

2^-\L-K  +  3) 

2^-\2Av+2L-2I<  +  4) 

0 

1 

1 

2^-nf  +  {f  +  l)iL-I<  +  l)] 

SOVA 

2^-i(2A  +  3e  +  4) 

2K-1 

2K-1 

2^-^[f  +  {f  +  l){L-K  +  l)] 

BCDFE 

QB&jSQfln 

2-' (2  A +4) 

2^if  +  L-J) 

MBCDFE 

2-'(A  -  J  -  1)  +  1 

2^{2A^  +  L-J  +  A) 

2-'(2L-2J  +  3) 

2J+1 

2-'[f{L-J  +  l)  +  K-J-l] 

Table  1:  Complexity  in  number  of  operations  per  iteration  (/  =  #bits  required  to  store  a  floating  point  number, 
e  =  ^places  where  the  code  bits  corresponding  to  two  converging  paths  differ.) 
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Eh/No2  =  Eh/No3  =  Eb/No4  (Mol  =  3.0  dB) 


Figure  3:  BER  performance  of  various  algorithms  in  near- 
far  situation. 

BER  curves  of  all  schemes  tend  to  single  user  under  ex¬ 
treme  conditions  of  MAI.  Thus  Fig. 2  depicts  almost  worst 
case  performance  for  user  1  given  its  Eb/NQ.  Note  that 
MBCDFE(1,3)  outperforms  SOVA(12)  under  power  con¬ 
trol  if  the  background  noise  is  not  too  high.  Although 
the  application  of  MBCDFE  requires  whitening,  its  per¬ 
formance/complexity  tradeoff  is  highly  desirable. 

Table  1  compares  the  complexity  of  various  algorithms 
with  binary  symbols  where  it  is  assumed  that  Ay  add  op¬ 
erations  are  required  to  compute  V'(.)  in  (8)  (or  the  equiv¬ 
alent  quantity  for  other  algorithms).  Note  that  the  com¬ 
plexity  of  OSOME(L)  is  on  the  order  of  2^”^(L  —  A”  +  !)• 
SSOME  mainly  consists  of  add-compare-select  operations 
and  exponentiation  is  only  used  for  soft-output  genera¬ 
tion.  The  complexity  of  MBCDFE(J,L)  is  on  the  order 
of  2^  {L  —  J).  The  modified  version  requires  some  extra 
storage  and  computation. 

In  conclusion,  the  subop timal  soft-output  multiuser  esti¬ 
mation  algorithm  with  decision  lag  equal  to  A  —  1  manages 
to  extract  most  of  the  benefits  of  soft-output  demodulation 
with  a  simple  forward  recursion  that  can  be  implemented 


without  whitening.  So  the  algorithm  is  suitable  to  use  with 
long  spreading  codes  and  time-varying  channel  conditions 
like  fast  fading.  Adaptive  complexity  reduction  techniques 
like  the  T-algorithm  and  T-algorithm  with  soft-limiting, 
can  be  applied  to  all  schemes  discussed  in  this  paper. 
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stores  the  quantities  and  p(6n~t  =  for 

J  <  i  <  L  and  b  e  {wi,  W2,  *  •  -  Note  that  jSn  is  a 

truncated  state  comprising  of  J  most  recent  bits. 


(22) 


where 


p(/?„,6„_jr,z„|2:" 

=  p(z„|z"-S6„,/?„_i)p(/?„_i|z"-i)p(6„) 


P(6n-jk”,/?n) 


p(/?n,6n-j,Zn|z" 

Ylbn-j  P(^"  >  ^n—J  1  1^” 


(23) 

(24) 


p(6„_f  =  6Iz”,;3„) 

=  P(&n-t  =  6k",/?»,&n-/)p(*»n-jk”>/^n) 

in- J 

=  Y1  P(^«--  =  tk”'S/^»-l)P(«»n-/k”,/?n),  (25) 

in- J 

for  J  +  1  <  i  <  L 


The  second  equality  follows  from  the  assumption  that 
given  and  /3n-ii  ^n-i  («^  + 1  ^  is  conditionally 

independent  of  Zn  and  (Not  true  unless  J  =  K  —  1), 

To  compute  p(2rn|z”“^,  bnyPn^i)  in  (23),  suppose  we  had 
made  hard  decisions  on  bits  b^^i^  J  +  1  <  i  <  K  —  1 
conditioned  on  the  previous  state  /?n-i  as  follows: 


6(n-*|z"  \y0„_i)  =  argmaxj,^_.p(6„_i|2:"  ^^n-l) 


Assuming  that  these  decisions  are  correct,  we  get 
p(z„|z’‘“S6„,/?„_i)  = 

P  (zn  \b„,0n-i,Hn  -  i\0n-l,z’^-%  J+lKiKK-l) 

which  is  a  known  Gaussian  pdf.  Thus  the  modified 
BCDFE(J,L)  algorithm  consists  of  these  steps:  For  all 
states  Pny  compute  T>(^n\z^),  p(tn-jk”,/?n)  for  all  values 

of  bn-j,  and  p(6„_,-  =  6jz”,  /?„)  for  6  €  {«i,  U2 . wm-i} 

and  J  +  1  <  i  <  L,  using  (22),  (24)  and  (25)  respec¬ 
tively.  Given  each  state  I3n,  make  hard  decisions  on  bits 
bn-i,  J  <  i  <  K  —  2  (these  decisions  will  be  used  to  cal¬ 
culate  p(z„+i|z”,6„+i,;0„)  in  the  next  iteration).  Store 
p(A.|z”),  p(6n-i  =  b\z'^,l3n)  {J  <  i  <  L  -  i  and  b  G 
{ui,U2, •••i«M-i})  and  6(n  -  i|z”,/?„)  (J  <  i  <  K  -2). 
At  lag  L,  obtain: 

Pibn-L\zn  = 


The  BCDFE  algorithm  [3]  makes  hard  decisions  on  sym¬ 
bols  bn^i,  J  <  i  <  L—l  conditioned  on  the  state  by  quan¬ 
tizing  p(6„_,*  =  6|2r”,  /?n)«  It  stores  these  hard  decisions  and 
updates  them  recursively.  Due  to  this,  the  symbol  APP’s 
obtained  at  lag  L  are  not  as  accurate  as  in  the  algorithm  we 
described  here  which  updates  and  stores  p(6n-i  =  b\z^^l3n) 
directly  (25)  and  uses  hard  decisions  only  to  truncate  the 
state.  Note  that  MBCDFE(K-1,L)  is  the  same  as  OSA(L). 


Figure  2:  BER  performance  of  various  algorithms. 


3  Simulation  results 

We  simulate  an  asynchronous  CDMA  system  with  four 
users  that  employ  BPSK  modulation  and  rate  memory 

4  convolutional  encoding  over  an  AWGN  channel.  User 
correlations  used  are  given  by: 

1 

H(D)=  H(m)D"' 


m=— 1 

1 

1 

\  +  2D 

-D 

_4_£)-i  7 

-4-D 

3 

l-^2D-^  _4_£>-i 

7 

-2  +  D 

-D-i  3 

-2  + D-i 

7 

Code  symbol  sequences  of  all  users  are  interleaved  by  a 
30  X  30  interleaver.  Fig. 2  shows  the  BER  of  user  1  versus 
the  signal-to-noise  ratio  (Ei/No)  in  perfect  power  control. 
Each  simulation  was  run  for  a  count  of  500  errors.  It  can 
be  seen  that  the  BER  curves  of  OSOME(3),  OSOME(12), 
SSOME(3),  SSOME(12)  and  MBCDFE(2,3)  lie  in  a  band 
of  width  0.2  dB.  OSOME(12)  loses  about  1.2  dB  over  sin¬ 
gle  user.  SSOME(3)  achieves  a  0.4-  0.5  dB  gain  over 
SOVA(12)  in  the  range  3  —  6  dB.  Note  that  SSOME(3)  re¬ 
quires  only  state  metrics  (and  no  path  metrics)  to  be  com¬ 
puted.  We  have  not  plotted  curves  for  SSOMEl  because 
SSOMEl(3)  is  the  same  as  SOVA(3)  and  SSOMEl(12) 
approaches  the  performance  of  SSOME(12).  The  modi¬ 
fied  BCDFE  algorithms  gain  1.0  —  2.0  dB  over  the  original 
BCDFE  because  the  soft  outputs  of  the  later  are  no  better 
than  hard  decisions. 

Fig. 3  shows  the  BER  of  user  1  versus  E^/Nq  of  in¬ 
terfering  users  (with  Eh/ No  of  user  1  fixed  at  3.0  dB). 
10®  data  symbols  were  counted  for  each  simulation.  The 


Note  that  p(o'Ar, r^(t))  =  Cf(<TN,y^)  and  p(aN,bN-i  = 
b,  r^{t))  =  CfiiaN,  b,  y^).  Thus  for  K  -  1  <  i  <  N  -  1, 


PibN-i 


b\r^it)) 


(13) 


Equations  (11),  (12)  and  (13)  complete  the  recursion  of 
the  OSOME  algorithm.  Note  that  it’s  not  necessary  to  ob¬ 
serve  the  entire  signal  r^(t).  Close  to  optimum  estimates 
can  be  obtained  for  bits  with  L  sufficiently  large,  by 
observing  r”(^)  only.  With  this  in  mind,  we  outline  the 
OSOME(L)  algorithm  as  follows: 

For  each  iteration  n  (n  =  1, 2, . . . ,  AT),  recursively  com¬ 
pute  and  store  the  state  metrics  given  by  (12).  For  each 
n>  z  =  iiT  “•  1,  Af, . .  .,L  and  h  £  {t^i,  •  •  •  j 

also  compute  and  store  the  path  metrics  given  by  (11). 
For  each  n>  L  and  6  E  {ui,  ^2, . . . ,  obtain  symbol 

APP’s  as: 


v{K-l 


b\r-(i))  = 


T,<,„hio-n,b,l/') 


(14) 


M-1 

Pibn-L  -  «M|r”(0)  =  1  -  XI  =  “•■|'*"W) 

*  =  1 

Note  that  if  data  symbols  of  all  users  are  equiprobable  a 
priori,  then  p(6n)  can  be  dropped  from  all  computations. 

2.2  SSOME 

Here  we  reduce  the  recursion  described  in  the  previous 
section  to  add-compare-select  operations  mostly  and  get 
the  SSOME(L)  algorithm.  For  this  we  define: 

m(«T„,j/”)  A  max  [11(1/”,  6")/iVo  +  log p(6")] ,  (15) 

-  Jn— K+1 

mi{<Tn,b,y")  A  rU(y”,l'")/-^o  +  logp(6")] , 

forA:-l<i<n-l.  (16) 

Then  a  derivation  similar  to  (11)  can  be  used  to  show  that 

mi{tTn,b,j/*)  =  m^  [TO,((T„_i,6,y”“^) 

+V(j/„,<r„-i,6„)] +logp(6„)  (17) 

and 

«i(o'n,y")=  max  [m(<T„_i, j/"“^)  +  V(j/„,<r„_i,6„)] 

+  logp(6n)  (18) 

Note  that  recursive  application  of  (18)  for  each  state 
<Tn  while  keeping  path  history  of  the  best  sequence 
leading  to  state  cr„  would  yield  maximum  like¬ 
lihood  hard  decisions  for  the  composite  data  sequence  6^. 
Let  =  6i, . . . ,  67v-t-i,  tjv-t+i,  •  •  •  j  then  note  that: 

=  Cexp[m(crjv,  2/^)],  and 


P  (u-jv,  b^_f+^  =  b^_f'^^{<rN),  bN-i  =  b,  r^(i)) 
=  C  exp[mi  {(Tn  ,  b,  y^)] . 


The  SSOME(L)  algorithm  consists  of  these  steps:  For 
each  iteration  n  (n  =  1, 2, . . . ,  iV),  recursively  compute  and 
store  the  state  metrics  given  by  (18).  For  each  state 
keep  the  best  path  comprising  of  the  latest  L  —  /\  +  1 
hard  decisions  6((Tn)  leading  to  state  an  as  in  the  Viterbi 
algorithm.  For  each  n  >  K,  recursively  update  and  store 
the  path  metrics  given  by  (17)  for  i  =z  K  —  1,  A", .  •  •  j  L  and 
b  ^  bn-i{an)  (Note  that  mj((7n,  6n-*(<Tn),  y”)  -  K^n,y”), 
so  there  is  no  need  to  store  it  separately).  For  each  n  >  L 
and  b  E  {i«i,  «2,  •  •  m  approximate  symbol  APP’s  as: 


p(bn-L  =  6k”(0)  = 


E),^^exp[mx(o-n,6,y")] 

E<7„,4exp[m£((T„,6,y'>)] 


(19) 


The  above  recursion  is  similar  to  that  of  SSA  [1],  except 
the  latter  estimates  symbol  APP’s  by: 


p{bn-L  =  t|r”(i))  = 


exp[mL«,6,y")] 

E;jexp[mL((r;;,6,y")] 


where  tr*  =  argrnax^^^^  yny  We  will  call  this  scheme 
SSOMEl(L). 


2.3  Modified  BCDFE 

In  this  section,  we  modify  the  BCDFE  algorithm  of  [3]  and 
apply  it  to  the  multiuser  channel.  The  sampled  matched 
filter  output  given  by  (3)  can  also  be  expressed  as  [6] 

i+K^l 

jzzi^K+1 

where  the  Gaussian  noise  samples  rii  are  correlated  with 
E[ninj]  =  ^i7/c(O,«(j)(?7(0  -  ^(i))-  A  noise  whitening  fil¬ 
ter  can  be  used  [7]  to  make  the  statistic  uncorrelated  (while 
maintaining  its  sufficiency  for  estimating  the  transmitted 
data  sequence),  to  get 

i-K^l 

j-i 

where  {zi}^^^  is  the  whitened  statistic,  Fij  are  stable  coef¬ 
ficients  obtained  from  the  factorization  of  matrix  G  [7]  and 
A[n^n'']  =  Hereafter  we  assume  that  Fij  can  be  per¬ 
fectly  estimated  prior  to  demodulation.  Note  that  given 
bnfbn^iy . . .  )bn^K+ii  {zi}f^i  IS  a  Sequence  of  i.i.d.  Gaus¬ 
sian  random  variables  with  mean 
and  variance 

Consider  the  receiver  of  Fig.l  with  the  “matched  filter 
bank”  (giving  out  t/j)  replaced  by  a  “whitened  matched 
filter”  (giving  out  z,-).  Let  z”  =  ^2,  •  •  m  ^n,  /?n  = 

fcnj  ^n-i)  •  •  * )  bn-j+1  where  1  <  J  <  AT  —  1  and  A  >  A  —  1. 
The  modified  BCDFE  algorithm  recursively  computes  and 


Figure  1:  Receiver  for  an  asynchronous  CDMA  system. 


2  System  Model 

Consider  a  Gaussian  CDMA  channel  shared  by  K  users 
who  transmit  (after  coding  and  interleaving)  independent 
data  streams  by  modulating  assigned  signal  waveforms. 
The  received  signal  r{t)  is  the  sum  of  K  delayed  signals 
and  an  additive  WGN  process  n{t)  with  power  spectral 
density  No/2 


where  C  is  a  positive  scalar  independent  of  and 
\J(y^  j  b^)  =:  '^i-i  bi^yw^/i)  ^yi  —  bi^yw^^ 

=  l  -j ,K{i) 

where  6i  =  0  for  2  <  0  and  ^  >  TV,  and 

ifz  +  i>A 

/CO 

-oo 


iotij  =  1,2,...,K. 


2.1  OSOME 

In  this  section,  we  determine  a  recursion  for  p{bi\r^{t)). 
Let  <rjv  =  bN,bN-i,.--,bN-K+2,  then  (4)  can  be  written 
as 


N'  K 

Kt)  =  y~!  y~l  bk{j)y/msk (t  —  jT  —  Tk)  +  n{t),  t  €  R  (1) 

jz=0  k=l 

where 

Skit)  =  V2ak{t)  cosiu  ct  +  Ok) 

bkU)  is  the  (j  +  l)th  data  bit  of  user  k  (6fc(j)  6 

{«i ,  U2, . . . ,  «m})  a  finite  alphabet).  sa:(^)  is  the  waveform 
transmitted  by  user  k  during  a  T-second  bit  interval,  ajk(t), 
Tk  j  Ok  and  Wk  are  respectively  the  normalized  spreading  se- 
quence,  the  time  delay,  the  carrier  phase  and  the  symbol 
power  of  user  k,  uJc  is  the  carrier  frequency.  Equation(l) 
can  also  be  expressed  as 

N 

In  the  above  equation,  i  =  Krf{i)  +  «(«)  —  1,  «(0  —  1  is 
the  remainder  of  i  mod  K,  r}{i)  is  the  integer  part  of  i/K^ 
and  N  =  (TV'  -I-  1)K  is  the  composite  message  length  of  K 

users.  The  composite  data  sequence  {&«(*) 
be  estimated  upon  observing  the  received  signal  r(t). 

The  receiver  structure  is  shown  in  Fig.l.  We  assume 
that  the  receiver  is  able  to  estimate  perfectly  the  signal 
powers,  the  time  delays  and  the  phase  lags  of  all  users  prior 
to  demodulation.  Also  assume  without  loss  of  generality, 
that  the  time  delays  of  users  are  in  increasing  order.  The 
matched  filter  output  of  user  /c(z)  at  time  r){i)  is 

yK(i)(»7(0)  =  I-oo  ««(•) 

*•=1,2,. ...AT  (3) 

For  notational  brevity  let  b^  =  {bi}^  =  {^K(«)(’?(®))}fci> 

=  {yi}f  =  {y«(O(^(0)}!Ii  and  r^(t)  =  {r(t), 
t  G  [rB(i),7;(iV)T  +  T,c(jv)]}-  It  has  been  shown  [5]  that 

p(r"(t)|6^)  =  Cexp[U(y^,6^)/Aro] 


U(j/^,  b^)  =  \]iy^-\b^-^)  +  ViyN,<rN-i,  bN)  (7) 
where 

V(yAr, <TN-i,bN)  =  bffy/w^(N)  [2j/7V  -  ^atv'u’kCjv) 

For  n  =  1, 2, . . . ,  AT,  define: 

/((r„,y”)  A  exp[U(y",6”)/IVo]p(6”),  (9) 

Jn-K  +  1 

/.•(tr„,6,y")  A  exp[U(y",6")/IVo]p(6"). 

for  a:  -  1  <  *•  <  n  ~  1.  (10) 

Now 

fi{<^n,b,^) 

=  ^  exp[V(j/„,(T„_i,6„)] 

.exp[U(y"-^6"-l)/A^o]p(6") 

=  exp[V(y„,(T„_i,6„)]p(6„) 

.  Y  exp[U(y"-’,6"-^)/Aro]p(6"-') 

=  Y  exp[V(y„,<r„_i,6„)]p(6„) 

&n-R:+l,^n-*  =  6 

(11) 

Where  the  first  equality  follows  from  (7)  and  the  second 
one  follows  from  the  assumption  that  {bi}^-i  are  indepen¬ 
dent  and  the  fact  that  V(yn5  cTn-i?  ^n)  does  not  depend  on 
jn-ir  From  (11),  it  follows  that 

/(o-n,y")=  Y/  exp[V(y„,(7„_i,6„)]p(6„)/((T„_i,y"-^) 

6n-K  +  l 

(12) 


not  WBE  signal  sets,  they  do  not  enjoy  the  properties  described  in  Proposition 
19  except  in  an  average  sense  -  a  situation  that  is  similar  to  that  discussed  in 
Proposition  15.  For  any  given  z,  there  are  i^Z])  subsets  of  size  L  that  include 
Any  other  belongs  to  {^ZD  of  subsets.  Hence,  (23)  and  (24) 
imply  that  the  following  result  holds: 

Proposition  20.  Fov  any  signal  x^^'^  in  a  WBE  signal  set  X  satisfying  the  hy¬ 
potheses  of  Proposition  19,  and  any  L,  2  <  I  <  K,  there  exists  ay  CX  of  size 
L  such  that  G  3^  und 

Ri<2(L-l)N^-^^{2N^-2N) 

n  <  2{L  -  l)N^  -  {N^  +  N-  2) 

The  right  sides  of  the  above  inequalities  are  just  (L  -  1)/(K  —  1)  times  the 
right  sides  of  (23)  and  (24).  In  fact,  these  quantities  are  respectively  the  average 
values  of  Ri  and  the  bound  on  r,-  over  all  subsets  of  X  that  contain  a;(’)  and 
L  -  1  other  signals.  Note  that  the  values  obtained  are  smaller  than  2{L  -  1)7V^ 
but  the  improvement  over  random  sequences  is  not  as  much  as  with  the  full 
WBE  signal  set.  Unfortunately,  although  the  j-th  signal  is  guaranteed  to  achieve 
a  good  SNR,  the  other  signals  in  the  set  do  not  necessarily  also  achieve  a  good 
SNR.  In  contrast,  for  a  WBE  signal  set,  all  the  K  signals  achieve  values  of  Ri 
smaller  than  2{K  -  1)A^^.  Note  that  Sun  and  Leib  [28]  have  proved  a  slightly 
different  version  of  Proposition  20.  Their  result  is  that,  averaged  over  all  subsets 
3^  C  .4“  of  size  L,  the  value  of  f{y)  is  bounded  by  (L  -  l)/iK  -  1)  times  the 
right  side  of  (24),  just  as  in  Proposition  20  above.  It  is  easily  shown  that  the 
Sun-Leib  result  also  holds  for  the  class  of  WBE  signals  described  in  Proposition 
19. 

5  Concluding  Remarks 

Signal  sets  that  meet  the  Welch  bounds  on  the  maximum  magnitude  of  the  in¬ 
ner  product  or  the  correlation  functions  are  not  easy  to  find.  In  contrast,  finding 
signal  sets  that  meet  the  Welch  bounds  on  the  RMS  magnitudes  is  almost  triv¬ 
ially  eeisy  -  almost  all  linear  codes  (and  their  cosets)  meet  the  latter  bounds, 
with  rates  and  minimum  distances  hardly  being  of  any  concern.  Thus,  meeting 
the  Welch  bound  on  the  RMS  magnitudes  is  far  too  broad  a  criterion  for  signal 
design,  and  the  traditional  measures  for  signal  performance  should  not  be  dis¬ 
carded.  The  recent  interest  in  proving  various  information-theoretic  results  via 
the  use  of  WBE  signal  sets  is  also  somewhat  puzzling.  It  would  seem  that  al¬ 
most  any  result  which  holds  only  when  the  signals  form  a  WBE  signal  set  could 
have  been  proved  long  ago  since  (at  least  in  terms  of  mean-square  Ramming 
weights)  it  has  been  known  for  nearly  forty  years  that  most  binary  linear  codes 
are  WBE  signal  sets!  Finally,  it  is  worth  re-emphasizing  that  subsets  of  WBE 
signal  sets  usually  are  not  WBE  signal  sets,  and  thus,  careful  analysis  is  needed 
to  determine  the  performance  of  systems  that  use  WBE  signal  sets. 
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Abstruct —  A  linear  receiver  for  direct-sequence  spread- 
spectrum  multiple-access  communication  systems  under  the 
aperiodic  random  sequence  model  is  considered.  The  receiver 
consists  of  the  conventional  matched  filter  followed  by  a 
tapped  delay  line  with  the  provision  of  incorporating  the  use 
of  antenna  arrays.  It  has  the  ability  of  suppressing  multiple- 
access  interference  (MAI)  and  narrowband  interference  in 
some  wdghted  proportions,  as  well  as  combining  multipath 
components  without  explicit  estimation  of  any  channel  conditions. 
Under  some  specific  simplified  diannel  models,  the  receiver 
reduces  to  the  mimmnm  variance  distortionless  response 
beamformer,  the  RAKE  receiver,  a  notch  filter,  or  an  MAI 
suppressor.  The  interference  rejection  capability  is  made  possible 
through  a  suitable  choice  of  weights  in  the  topped  delay  line. 
The  optimal  weights  can  be  obtained  by  straightforward  but 
computationally  complex  eigenanalysis.  In  order  to  reduce  the 
computational  complexify,  a  simple  blind  adaptive  algorithm 
is  also  developed. 

Index  Terms — ^Adaptive  s^pal  detection,  antenna  arrays,  code¬ 
division  multiaccess,  interference  suppre^on,  pseudonoise-coded 
communication. 

L  Introduction 

Many  researchers  have  investigated  receiving  structures 
to  suppress  interference,  especially  multiple-access 
interference  (MAI),  in  direct-sequence  spread-spectrum 
multiple-access  (DS/SSMA)  systems.  Verdii  [1]  worked  out 
die  optimum  multiuser  detector,  which  is  near-far  resistant, 
for  DS/SSMA  systems  over  additive  white  Gaussian  noise 
(AWGN)  chaimels.  Lupas  and  Verdii  [2]  considered  simpler 
linear  receivers  to  reject  MAI.  Roughly  speaking,  the  main 
woiidng  principle  of  these  receivers  is  to  project  the  signature 
sequence  of  die  desired  user  onto  the  space  orthogonal  to  the 
signature  sequences  of  the  interferers.  Other  MAI  rejection 
receivers  bas^  on  the  minimum  mean-squared  error  (MMSE) 
criterion  are  also  developed  in  [3]-[7]  (see  [13]  for  a  survey  of 
the  literature).  They  are  adaptive  in  nature  by  making  use  of  the 
cyclostationarity  of  the  received  signal.  Recently,  Honig  et  al. 
[8]  proposed  a  blind  adaptive  linear  receiver  which  eliminated 
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the  need  for  training  sequences  as  in  [3]-[7].  Another  way  to 
suppress  MAI  is  to  explore  the  spatial  relationship  between  a 
user  and  the  interferers  and  to  utilize  the  steering  capability 
of  an  antenna  array.  Kohno,  et  al  [9],  [10],  and  Naguib,  etal. 
[11],  [12]  worked  in  this  direction.  Kohno  [9]  also  suggested 
a  combination  of  spatial  processing  techniques  and  some  of 
(he  MAI  rejection  structures  described  above. 

In  most  of  these  papers,  the  MAI  suppression  schemes 
are  developed  under  the  periodic  sequence  assumption,  in 
which  the  signature  sequences  employed  to  spectrally  spread 
the  data  sequences  are  periodic  with  period  equal  to  a  data 
symbol  duration.  This  assumption  differs  significantly  from 
the  IS-95  standard  [16]  in  which  signature  sequences  with 
much  longer  periods  are  employed.  Wth  sequences  having 
long  periods,  the  receiver  considered  in  [1]  is  too  complex 
for  practical  implementations.  The  decorrelator  receiver  in 
[2]  would  also  ^ve  to  be  modified  in  every  symbol  period. 
Hence,  its  applicability  is  also  limited.  Moreover,  as  the 
cyclostationarity  assumption  in  [3]-[8]  may  not  hold  any 
more,  the  adaptive  receivers  in  [3H8]  may  not  be  capable 
of  rejecting  MAI  with  the  use  of  long  signature  sequences.  On 
the  other  hand,  the  efficacy  of  the  spatial  processing  techniques 
should  remain  provided  the  interferers  and  the  desired  user  are 
not  clustered  spatially.  The  challenge,  however,  is  to  develop 
a  receiving  structure  which  can  effectively  employ  spatial 
processing  techniques  to  reject  MAI  with  the  use  of  long 
signature  sequences. 

To  facilitate  such  a  development,  we  consider  in  this  paper 
DS/SSMA  systems  with  aperiodic  signature  sequences.  As 
the  sequences  are  aperiodic,  it  is  reasonable  to  model  each 
element  of  tiiem  to  be  an  independent  random  variable.  Hence, 
we  adopt  the.  aperiodic  random  sequence  model  which  pro¬ 
vides  a  good  approximation  to  the  long  signature  sequences 
employed  in  practical  systems.  We  consider  a  decentralized 
linear  receiver  composed  of  the  conventional  matched  filter 
(despreader)  followed  by  a  tapped  delay  line  (TDL)  with  the 
provision  of  incorporating  the  use  of  antenna  arrays.  We  point 
out  that  the  observables  in  the  proposed  receiving  structure 
are  the  output  samples  (possibly  faster  than  the  chip  rate) 
from  the  despreader.  This  distinguishes  this  work  from  many 
others  ([4]-[8])  in  which  the  observables  are  outputs  from 
the  chip-matched  filter.  Details  of  the  receiving  structure  and 
the  channel  model  are  given  in  Section  H.  We  note  that 
an  approximate  decorrelating  multiuser  detector,  which  is 
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applicable  to  systems  with  long  signature  sequences,  is  also 
developed  in  [15].  The  major  difference  between  this  work  and 
[15]  is  that  the  receiver  in  [15]  is  centralized,  and  is  developed 
under  the  deterministic  sequence  model. 

The  weights  in  the  TDL  are  chosen  to  maximize  the  signal- 
to-noise  ratio  (SNR),  which  is  defined  in  Section  IV.  Optimal 
choices  under  the  minimum  mean  square  error  (MMSE)  and 
the  constrained  minimum  output  energy  (CMOE)  criteria  are 
also  considered.  In  fact,  we  show  in  Section  IV  that  the  three 
criteria  are  asymptotically  equivalent  when  the  number  of 
chips  per  symbol  N  approaches  infinity.  The  performance 
of  the  receiver  is  examined  for  a  multipath  fading  channel 
with  MAI,  narrowband  interference,  and  AWGN.  It  turns  out 
that  the  proposed  receiver  has  the  ability  of  suppressing  MAI 
and  narrowband  interference  in  some  weighted  proportions,  as 
well  as  combining  multipath  components  simultaneously.  It 
actually  unifies  the  minimum  variance  distortionless  response 
(MVDR)  beamformer,  the  RAKE  receiver,  a  notch  filter,  and 
an  MAI  suppressor.  As  shown  in  Section  V,  it  reduces  to  these 
structures  under  simplified  channel  models. 

In  the  development  of  Sections  FV  and  V,  there  is  a  certain 
parallelism  between  the  MMSE-based  receivers  in  [3]-[8]  and 
the  receiver  proposed  in  this  paper.  Although  this  parallelism 
might  lead  to  similar  properties  and  practical  implications,  it 
cannot  be  obtained  as  a  direct  consequence  from  die  results 
derived  in  [3]~[8]  since  the  structures  of  the  receivers  and 
the  underlying  system  models  are  different  in  our  case.  In 
order  to  establish  the  results  in  Sections  IV  and  V,  we  need 
two  important  asymptotic  properties  of  the  despread  received 
signal  at  output  of  the  matched  filter  which  are  derived  in 
Section  HI.  The  first  property  is  that  the  despread  received 
signal  is  cyclostationary,  and  the  second  one  is  that  the 
correlation  matrix  of  the  desired  signal  component  is  of  rank 
one.  These  two  properties  form  the  basis  of  most  of  the  work 
developed  in  this  paper. 

Sections  VI  and  Vn  address  the  implementation  issues.  In 
Section  VI,  we  see  that  optimal  weights  under  the  maximum 
SNR  criterion  can  be  obtained  by  straightforward  eigenanal- 
ysis.  In  practice,  we  do  not  have  prior  knowledge  of  the 
channel  parameters.  Based  on  the  asymptotic  properties  of 
the  despread  received  signal,  we  develop  a  simple  method 
to  estimate  and  optimize  the  SNR  simultaneously  from  two 
different  observations  of  the  despread  received  signal,  namely, 
a  signal-plus-interference  observation  and  an  interference-only 
observation.  In  [11]  and  [12],  Naguib  et  al  employed  a 
similar  approach  by  using  the  spread  and  despread  received 
signals  to  estimate  the  SNR.  However,  that  approach  does  not 
work  under  the  asynchronous  model  considered  in  this  paper. 
The  eigenanalysis  algorithms  are  computationally  complex 
for  time-varying  channels.  Because  of  the  cyclostationarity 
of  the  despread  received  signal,  it  is  possible  to  construct 
adaptive  algorithms  to  obtain  the  weights  in  the  TDL.  Based 
on  the  CMOE,  we  develop  a  computationally  simple  blind 
adaptive  algorithm  for  the  optimization  to  eliminate  the  need 
for  training  sequences. 

Using  the  blind  adaptive  algorithm  developed  in  Section 
vn,  the  proposed  receiver  requires  the  same  amount  of  infor¬ 
mation  as  the  conventional  matched  filter  receiver,  namely,  the 


signature  sequence  and  the  synchronization  information  (the 
timing  and  the  carrier  phase)  of  the  desired  user.  The  receiver 
is  decentralized  in  nature,  i.e.,  each  receiver  works  indepen¬ 
dently  for  a  single  user.  The  proposed  receiving  structure  is 
useful  in  both  the  base  station  and  the  mobile  units  in  wireless 
DS/SSMA  systems  because  it  is  decentralized  and  can  reject 
interference  with  a  single  algorithm.  Note  that  even  if  only  a 
single  element  antenna  is  used  in  a  mobile  unit  due  to  physical 
constraints,  the  proposed  receiver  still  combats  narrowband 
interference  and  combines  multipath  components. 


n.  System  Model 


In  this  section,  we  describe  the  model  of  the  DS/SSMA 
system.  We  assume  that  there  are  K  simultaneous  users  in  the 
system. 

The  jkth  user,  for  1  <  A;  <  if,  generates  a  data  sequence 


=  (...,  fcWjO 


(fc) 


.  . 


with  a  symbol  duration  of  T  seconds.  We  assume  that  the  data 
symbols  6^*'^  are  random  variables  with  =  1.  Note 

that  this  assumption  is  satisfied,  apart  from  a  trivial  scaling 
factor,  by  almost  all  coded  and  uncoded  data  symbols. 

The  ikth  user  is  provided  a  randomly  generated  signature 
sequence 


7W  = 


(fe)  „(fc) 


The  elements  are  independent  random  variables  such  that 
=  0  and 

^[|af  ^1']  =  =  -Ellaf  ¥]  =  1- 

The  sequence  is  used  to  spectrally  spread  the  data  symbols 
to  form  the  signal 

«(*)=  E 

t=:— 00 

where  the  chip  duration  Tc  is  given  by  Tc  =  T/N ,  iV  is  the 
number  of  chips  per  symbol  interval,  and  ^(t)  is  the  common 
chip  waveform  for  all  signals.  The  chip  waveform  V^(i)  is 
time-limited  to  the  interval  [0,  Tj,  and  normalized  so  that 

Jo 

The  transmitted  signal  for  the  A;th  user,  for  I  <  k  <  Ky 
can  be  expressed  as 

Re  [y/^ak{t  -  Tfc)  exp  (jWc(<  -  Tk))]  (2) 

where  Pk  is  the  power  for  the  A;th  signal,  uJc  is  the  earner 
frequency,  and  Tk  is  the  delay  that  models  the  asynchronous 
system. 

Without  loss  of  generality,  throughout  the  paper  we  will 
consider  the  signal  from  the  first  user  as  the  communicating 
signal  and  the  signals  from  all  other  users  as  interfering  signals. 

We  now  describe  the  channel  model.  We  assume  that  the 
channel  is  a  multipath  fading  channel  corrupted  by  narrowband 
interferers  as  well  as  AWGN  with  two-sided  power  spectral 
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Fig,  1.  Linear  receiver  for  the  jth  data  symbol. 


density  No/2,  We  assume  that  there  are  Kb  narrowband 
interferers.  Each  narrowband  interfering  signal  is  assumed  to 
be  a  pure  tone. 

We  assume  that  the  signals  are  received  by  an  antenna  array 
of  D  elements.  The  signal  vector  received  by  the  antenna  array 
in  complex  baseband  notation  is  given  by 

r{t)  =  y{t)  +  ni{t)  +  nsit)  +  nw{t)  (3) 

where  nwit)  represents  AWGN.  We  assume  that  the  AWGN 
is  also  spatially  white.  The  signal  contribution  y{t)  is  given  by 

y{t)  = 

A=1 

(4) 

The  MAI  contribution  n/(<)  is  given  by 

K  _  Lk 

”/(*)  =  -s/ZPfc  y^gfc,AOfc(t  -  Tfc  -  Tk,x) 
fc=2  A=1 

.  g-J^ciTk+rk.x)  dk,x-  (5) 

The  narrowband  interference  contribution  nsCO  is  given  by 
K+Kb 

nB{t)=  dk.  (6) 

fc=ir-i-i 

In  (4)  and  (5),  Lk  is  the  number  of  propagation  paths  ftom  the 
fcth  transmitter  to  the  antenna  array.  The  parameters  Tk,x,  9k, x, 
and  dk,x  represent  the  delay,  the  complex  gain,  and  the  array 
response  vector  associated  with  the  Ath  path  of  the  signal  from 
the  ifcth  transmitter.  In  (6),  the  parameters  Pk,  9k,  h,  and  dk 
represent  the  power,  the  complex  gain,  the  frequency  deviation 
from  Wc,  and  the  array  response  vector  associated  with  the 
tone  indexed  by  k  for  K-{-l<k<K  +  Kb-  Wthout  loss 
of  generality,  we  assume  that  6k  ^  for  k  ^  1. 

To  facilitate  the  derivation  of  the  receiver,  we  assume  that 
the  communication  channel  is  time-invariant.  In  particular, 
the  number,  the  powers,  the  frequencies  (for  narrowband 
interferers),  the  locations,  and  the  delay  profiles  of  the  users 
and  the  interferers  are  fixed  (but  unknown).  Only  the  data,  the 
signature  sequences,  and  the  AWGN  are  random.  It  turns  out 
that  the  resulting  receiver  is  able  to  adapt  quickly.  Therefore, 
such  an  assumption  does  not  affect  the  usefulness  of  the 
receiver. 

We  assume  that  we  have  achieved  synchronization  with  the 
path  of  the  signal  from  the  desired  user  that  arrives  earliest  at 


the  antenna  array.  Hence,  we  may  assume  Ti  =  0,ri,i  =  0, 
and  ti,a  >  0  for  2  <  A  <  I-i.  We  also  assume  that  ti,a  <  Tmax 
for  all  A  where  Tmax  is  the  maximum  delay  spread  of  the 
channel.  For  coherent  detection,  we  may  assume  the  phase 
angle  of  and  that  of  the  first  element  of  the  array  response 
vector  di,i  to  be  zero. 

The  receiver  is  shown  in  Fig.  1.  To  detect  the  jth  symbol 
the  received  signal  r(<)  is  passed  through  a  matched  filter 
with  impulse  response  hj{t)  where 

t=0 

The  despread  received  signal  at  the  output  of  the  matched 
filter  fj{t)  is  sampled  once  every  T*  seconds  to  give  a  sample 
vector  of  length  D,  where  Tc/Ts  =  S  is  the  number  of 
sample  vectors  per  chip  interval.  Note  that  for  simplicity  in 
notation,  the  matched  filter  in  (7)  is  set  to  be  noncausal.  To 
get  a  causal  matched  filter,  we  can  simply  delay  the  impulse 
response  of  the  filter  described  by  (7).  Consequently,  we  have 
to  delay  the  sampling  by  the  same  amount  of  time.  If  5  is  an 
integer,  we  have  die  equivalent  multirate  implementation  of 
the  matched  filter  as  shown  in  Fig.  2.  Since  ti,a  <  Tmax  for 
all  A,  only  the  first  M  =  [(Tmax  +  Tc)/Tsl  sample  vectors 
in  the  jth  symbol  interval  contain  significant  contributions 
from  the  signal  of  the  desired  user.  The  M  sample  vectors 
fiijT),fj{jT+Ts),  ■  ■  •  ,f,(jT+(M-l)r.)  are  concatenated 
to  give  the  vector  Vj  of  lengfli  MD. 

Finally,  the  decision  statistic  for  the  yth  symbol  interval 
is  formed  by  w^Vj,  where  t»  is  a  weight  vector  whose 
components  remain  to  be  determined.  Notice  that  there  is 
a  complex  scaling  introduced  by  the  weight  vector  in  the 
decision  statistics.  However,  such  a  scaling  is  immaterial  if 
differential  encoding  is  applied. 

In  order  to  facilitate  the  process  of  optimizing  the  weight 
vector,  a  counterpart  of  Vj  that  contains  mainly  noise  Md 
interference  contributions  is  needed.  More  precisely,  we  pick 
an  integer  M'  such  that  M'Tc  >  MTs-  The  M  sample  vectors 

fjUT  +  M'Tc),fj{jT  +  M'Tc  +  Ts), 

-  -  - ,  fj  (jT  +  M'Tc  +  (M  -  1)T,) 

are  taken  and  concatenated  to  give  the  vector  Vj  of  length 
MD.  Note  that  the  timing  difference  between  each  element  of 
Vj  and  the  corresponding  element  of  Vj  is  M'Tc,  an  integral 
multiple  of  the  chip  duration. 
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Fig.  2.  Equivalent  multirate  implementation  of  the  matched  filter. 


In  developing  the  above  sampling  scheme,  we  have  assumed 
that  the  maximum  delay  spread  of  the  channel  Tmax  cannot 
be  larger  than  several  Tc>  which  is  typical  in  most  wireless 
cellular  systems  employing  DS/SSMA.  Otherwise,  the  size  of 
the  sample  vectors  could  be  large.  If  r^ax  is,  indeed,  large, 
other  sampling  schemes  might  be  used  to  reduce  the  size  of 
the  sample  vectors.  For  example,  we  can  estimate  the  arrival 
times  of  the  paths  and  sample  only  at  those  instances. 

ni.  Correlation  Properties 

In  this  section,  we  consider  the  correlation  properties  of 
the  samples  of  despread  received  signal  at  the  output  of  the 
matched  filter.  Wthout  loss  of  generality,  we  consider  the 
samples  associated  with  the  detection  of  the  symbol 

The  despread  signal  at  the  output  of  the  matched  filter  fo(t), 
for  i  >  0,  is  given  by 

ro(0  =  tfW  +  (8) 

where  y{t)^  nj(f),  nait),  and  hw{t)  are  the  matched  filter 
output  due  to  the  desired  signal,  the  MAI,  the  narrowband 
interference,  and  the  thermal  noise  components,  respectively. 
It  is  easy  to  see  that  the  random  processes  y{t),  n/(t), 
and  npv'(t)  are  uncorrelated.  Hence,  the  correlation  matrix 
Rr{t,s)  defined  as  -B[fo(t)f^(5)]  is  given  by 

Rr{t,  s)  -Ry{t,  s)  ^Riij  (t,  s)^Rn^  (t,  $) +Rhw  «)  (9) 

where 


1  oo 


N 

t=— oo 
i#0 


•  ■0(f  -  iTc  -  -  iTc  -  Ti,„) 

i=-iv+l 

tT^O 


•  *0  (s  —  iTc 


=£7[nj(<)nf  (5)] 


fc=2  A=1  v=l 


4fc,A4"i/*  ^  ^it~iTc-Tk-Tk,x) 


•  ^*(s  -  •"  Tfc  -  Tk,u) 

RAgit,s)=E[hB{t)n^{f)] 


=  E  E 

k=K+l  l=K+l 


^,..^..(r.sm{NUSk-6t)/2) 


Ri{t,s)=E[mris)] 

=  2PiT2  53  53  di,x 

A=1  *'=1 


RaAtis)  =£;[nw^(£)a^(s)] 

=  NoT^>{t-8)lDxD-  (13) 
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Ej;(0,0)  Rr{0,Ts)  ■■■  i2f (0, (M  -  1)T,) 

Rf{Ts,0)  Ri.{Ts,Ts)  ; 


LRf((M-l)T„0) 


(20) 


In  (10)-(13),  the  function  ‘tp(-)  is  the  autocorrelation  of  the 
chip  waveform  defined  by 

1 

i){t)  =  —  f  ‘il}{s)i)*{s-t)ds  (14) 

■^c  J—OO 

and  the  function  ^'(•)  is  the  Fourier  transform  of  defined 
by 

^{S)  =  ^  f  ‘  ds.  (15) 

Jo 

Notice  that  for  fixed  t  and  s,  the  sums  involving  in  (10) 
and  (11)  contain  only  finite  numbers  of  nonzero  terms  because 
^(•)  is  time-limited  to  [— Tcj^c]* 

We  observe  two  important  asymptotic  properties  of  the 
correlation  matrix  Rr{t,s)  which  will  be  needed  in  later 
sections.  First,  from  (10)^ 

lim  Ry{t,  s)  =  z{t)z{s)^  (16) 

JV— +00 


where 

Li 

i{t)  =  -  Ti,x)di,x  (17) 

A=1 

i.e.,  when  the  number  of  chips  per  symbol  is  large,  we  can 
neglect  the  effect  of  interchip  interference  and  approximate 
the  correlation  matrix  of  the  desired  signal  component  as  an 
outer  product  of  two  vectors. 

Secondly,  we  see  from  (11)  that  n/(f)  is  cyclostationaiy 
with  period  Tc,  that  is,  for  any  integer  i 

Rnj  (f,  s)  =  Rhi  {t  -h  iTc,  s  4-  iTc)-  (18) 

Moreover,  it  is  obvious  from  (13)  that  nw{t)  is  wide-sense¬ 
stationary.  We  also  notice  from  (12)  that  nsit)  is  asymp¬ 
totically  cyclostationary  with  period  Tc  when  N  approaches 
infinity  because  of  the  fact  that 

smiNTc(6k-6i)/2)  1,  if  5^  =  5^  (mod27r/Tc) 
N^oo  Nsm{Tc{6k-Si)/2)  \0,  otherwise. 

(19) 

Combining  all  these,  we  see  that  the  overall  noise  and  inter¬ 
ference  component  in  fo{t)  is  asymptotically  cyclostationaiy 
with  period  Tc  when  N  approaches  infinity. 

*  The  limiting  argument  here  can  be  interpreted  as  the  convergence  in  matrix 
2-norm.  Recall  that  the  2-norm  of  a  matrix  A  is  defined  [18,  Ch.  2]  as 
||A||  =  sup^^o  (ll*^ll/ll*ll)»  where  the  operator  ||  •  ||  denotes  both  tte 
2-norm  of  a  matrix  and  the  l^clidean  norm  of  a  vector.  Throughout  this 
paper,  all  limiting  arguments  concerning  matrices  and  vectors  are  interpreted 
as  convergences  in  2-norm  for  matrices  and  in  Euclidean  norm  for  vectors, 
respectively. 


With  Rr(t,s),  we  find  that  the  correlation  matrix 
^  E[vov^] 

of  the  sample  vector  vq  described  in  Section  II  is  given  by 
(20)  at  the  top  of  this  page. 

Inheriting  its  properties  from  Rr(t,s),Rv  can  also  be  de¬ 
composed  into  two  parts,  namely, 

(21) 

The  first  part,  is  due  to  the  desired  signal  contribution 
in  Vq.  Just  as  Ry{t,s),  Rz  is  also  asymptotically  equal  to  an 
outer  product  of  two  vectors  when  N  tends  to  infinity.  More 
precisely, 

lim  Rz  =  zz^  (22) 

iV~+oo 


where 


m  1 

~z(.Ts) 


(23) 


The  second  part,  Rn,  is  due  to  the  overall  noise  and  interfer¬ 
ence  contribution  in  t>o,  which  is  asymptotically  cyclostation¬ 
ary  with  period  Tc- 

Similarly,  the  correlation  matrix  Rf,  =  E[vqVq  ]  of  the 
sample  vector  tioj  has  the  same  form  as  Rv  except  that 
Rf{iTs,jTi,)  is  replaced  by 


RriM'Tc  +  iTs,M'Tc  +  jT^),  for  0  <  i,  j  <  M  -  1- 

It  can  also  be  decomposed  into  two  parts.  The  first  part,  which 
we  denote  by  Ri,  is  due  to  the  desired  signal  contribution 
in  uo.  Since  M'Tc  >  MT,  >  T„ax  +  Tc,  (10)  shows  that 
limjv-ooiJ*  =  0.  This  is  the  reason  why  we  say  that  vq 
contains  mainly  noise  and  interference  contributions.  The 
second  part,  which  we  denote  by  Ra,  is  due  to  the  overall 
noise  and  interference  contributions  in  Uq.  From  the  asymptotic 
cyclostationarity  of  the  overall  interference,  R^  =  Rn  when 
N  tends  to  infinity.  Consequently,  we  have 

Um  {R„  -  Ri,)  =  zz^ .  (24) 

iV-~*-oo 

Equations  (22)  and  (24)  will  be  employed  repeatedly  in  all  the 
following  sections. 
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IV.  Optimal  Weight  Vector 

In  this  section,  we  consider  optimal  choices  of  the  weight 
vector  w  according  to  three  criteria,  namely,  the  maximum 
signal-to-noise  ratio  (MSNR),  the  minimum  mean-squared 
error  (MMSE),  and  the  constrained  minimum  output  energy 
(CMOE)  criteria.  MSNR,  which  is  closely  related  to  the 
minimization  of  error  probability,  is  the  main  criterion  while 
MMSE  and  CMOE  are  useful  in  adaptive  algorithm  develop¬ 
ment.  As  shown  in  Proposition  1,  the  three  criteria  are  actually 
asymptotically  equivalent.  Notice  that  the  equivalence  of  three 
similar  criteria  has  been  established  in  [7]  and  [8].  However, 
the  results  are  based  on  the  deterministic  periodic  sequence 
model,  which  is  quite  different  from  the  aperiodic  random 
sequence  model  used  in  this  paper. 


A.  Maximum  Signal-to-Noise  Ratio  (MSNR) 

We  first  consider  the  weight  vector  that  maximizes  SNR 
defined  by,  for  w  ^  0 


SNR(iit;)  = 


w^RzW 

W^RnW^ 


(25) 


Note  that  the  existence  of  the  AWGN  component  guarantees 
that  the  matrix-lin  is  positive  definite.  Hence,  the  SNR  defined 
in  (25)  is  always  finite  and  the  matrix  pencil  (Rz^Rn)  is 
regular  [17,  Ch,  X].  Moreover,  since  both  Rz  and  Rn  dxc 
Hermitian,  all  of  the  generalized  eigenvalues  associated  with 
the  matrix  pencil  are  real  [17,  Ch.  X].  It  can  be  shown  [17, 
Ch.  X]  that  the  weight  vector  that  maximizes  the  SNR  is  the 
generalized  eigenvector  associated  with  the  largest  generalized 
eigenvalue  of  the  matrix  pencil  (Rz^Rn)-  Since,  for  tu  ^  0 


W^RyW 

W^RnW 


=  SNR(tu)  +  l 


(26) 


the  optimal  weight  vector  is  also  the  generalized  eigenvector 
associated  with  the  largest  generalized  eigenvalue  of  the  matrix 
pencil  {Rv,  Rn)-  Since  Ry  closely  approximates  Rn,  it  is  con¬ 
ceivable  that  the  optimal  weight  vector  may  be  approximated 
by  the  generalized  eigenvector  associated  with  the  largest 
generalized  eigenvalue  of  the  matrix  pencil  (.R^,  R^)  when  N 
is  laige.  This  approximation  will  be  justified  in  Proposition  1. 


B.  Minimum  Mean-Squared  Error  (MMSE) 

We  may  also  find  the  weight  vector  that  minimizes  the 
mean-squared  error,  defined  by 

MSB  (to)  =  -  to^t;o|2].  (27) 

It  is  well  known  [20]  that  the  MSE  is  minimized  by 

tUiv  =  SLTg  min  [MSE  (w)]  =  Ry^z  (28) 

for  a  particular  N, 


C  Constrained  Minimum  Output  Energy  (CMOE) 

As  our  objective  is  to  suppress  the  interference  component 
in  the  received  sample  vector,  we  can  choose  w  so  that  the 
output  energy  is  minimized  with  the  constraint  that  the  desired 
signal  component  is  fixed,  which  is  exactly  the  idea  of  the 


MVDR  beamformer.  More  precisely,  we  define  the  output 
energy  as 

a{w)  =  w^RyV)  (29) 

and  choose  «;  to  be  Aat  where 

WisT  =  arg  min  a{w)  (30) 

for  some  constant  c  9^  0  and  a  particular  N.  By  the  method 
of  Lagrange  multipliers,  it  can  be  shown  [20]  that  the  MMSE 
solution  in  (28)  is  also  the  solution  of  (30)  when  c  =  z^R~^z. 
Moreover,  the  solution  of  (30)  for  any  c  0  is  a  scalar 
multiple  of  the  MMSE  solution. 

We  note  that  the  MMSE  and  CMOE  criteria  are  basically 
equivalent  since  they  share  a  conunon  solution.  They  are 
asymptotically  equivalent  to  the  MSNR  criterion  according 
to  the  following  proposition  whose  proof  can  be  found  in 
Appendix  I. 

Proposition  1:  Let  Wn  be  an  eigenvector  associated  with 
the  laigest  generalized  eigenvalue  of  the  matrix  pencil 
{R„,  Ri)  and  wn  =  R^^z  for  a  particular  N.  Let  wn  and 
tt;;v  be  defined  as  in  (28)  and  (30),  respectively.  Then,  for  any 
e  >  0,  when  N  is  sufficiently  large 

[SNR  (to)]  —  SNR  (tojv)  <  e 

max  [SNR  (to)]  —  SNR  (tojv)  <  £ 

max  [SNR  (tu)]  —  SNR  (tbiv)  <  ^ 

max  [SNR  (tu)]  -  SNR  (wn)  <  e. 

V.  OPTIMALrnES  OF  THE  RECEIVER 

In  this  section,  we  consider  the  behavior  of  the  proposed  re¬ 
ceiver  under  different  situations.  Under  appropriate  conditions, 
the  receiver  reduces  to  the  MVDR  beamformer,  a  notch  filter 
for  suppressing  narrowband  interference,  the  RAKE  receiver, 
or  an  MAI  suppressor.  Notice  that  some  of  these  properties 
have  been  developed  independently  for  other  receivers  pro¬ 
posed  in  [3]-[8]  under  the  periodic  sequence  model.  Under 
the  aperiodic  random  sequence  model,  we  are  able  to  unify 
these  properties  into  a  single  linear  receiving  structure. 

A.  MVDR  Beamformer 

To  illustrate  the  beamforming  capability  of  the  proposed 
receiver,  we  consider  an  AWGN  channel  with  MAI  only. 
Hence,  nsit)  =  0,Ljfe  =  1,  and  =  exp(j^fc)  for 
1  <  A;  <  if,  where  Ok  is  the  phase  shift  for  the  kth  user  signal. 
Since  we  assume  synchronization  with  the  signal  from  the  first 
user,  we  have  6i  =  0.  The  absence  of  multipath  fading  also 
implies  that  r^ax  =  0. 

Suppose  that  the  output  of  the  matched  filter  fo(f)  is 
sampled  once  every  bit  interval  at  f  =  jT,  Then  M  =  1 
and  Vo  =  f(0).  Hence,  from  (10)-(13),  for  any  N, 
Rz  =  RyM  =  zz^,  faidRn  =  Rhj{0,0)  +  RawM, 
where  z  =  y/2PiTgi  4i,i.  Notice  that  the  vector  z  is  just  the 
array  response  vector  of  the  desired  user.  Maximizing 
the  SNR  defined  in  (25)  is  equivalent  to  minimizing  the 
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Fig.  3.  Directional  magnitude  response  of  the  beamformer  example. 


noise  variance  w^RnW  subject  to  the  constraint  that  the 
signal  strength  is  kept  constant.  This  is  exactly  the 

operation  performed  by  the  MVDR  beamfonner  [20]. 

As  an  example,^  we  consider  a  two-user  system.  The 
received  power  of  the  interferer  is  20  dB  stronger  than  that  of 
the  desired  user,  i.e.,  P2|5'2p  =  lOOPil^ip.  The  signal  of  the 
desired  user  comes  at  0®  whereas  the  interferer  signal  comes 
at  60°.  Fig.  3  shows  realizations  of  the  normalized  directional 
magnitude  responses  of  the  optimal  receiver  output  for  a  five- 
element  array  and  a  nine-element  array.  The  spatial  selectivity 
of  the  receiver  is  shown  by  the  deep  nulls  at  60° . 

Next,  we  show  the  performance  of  the  receiver  when  the 
number  of  interferers  increases.  The  received  power  of  each 
interferer  is  20  dB  stronger  than  that  of  the  desired  user.  The 
signal  of  the  desired  user  comes  at  0°  whereas  each  interferer 
signal  comes  at  an  independent  and  uniformly  distributed 
random  angle.  The  average^  maximum  SNR  achievable  by 
the  receiver  using  anterma  with  different  numbers  of  elements 
is  plotted  in  Fig.  4.  Obviously,  we  see  the  advantage  of  using 
an  antenna  array  with  more  elements  than  a  single  antenna. 

B,  Notch  Filter  for  Suppressing  Narrowband  Interference 

To  illustrate  the  notch  filtering  capability  of  the  proposed 
receiver,  we  consider  an  AWGN  channel  with  the  presence 

^For  this  and  all  other  examples  in  this  paper,  we  assume  binary  data 
sequences  and  binary  signature  sequences  widi  the  use  of  a  rectangular 
chip  waveform.  The  bits  in  the  data  sequences  are  equally  probable  and 
independent.  Moreover,  we  always  assume  AT  =  127  and  the  existence  of 
AWGN  with  one-sided  power  spectral  density  No  such  that  the  signal-to- 
thermal-noise  ratio  (STNR)  is  15  dB. 

^The  word  “average**  here,  and  in  the  following  examples,  means  that  the 
value  refened  to  is  obtained  by  averaging  over  500  realizations. 


of  a  single  user  and  a  single-tone  narrowband  interferer  only, 
i.e.,  a:  =  1,  JiTB  =  1,  and  ni{t)  =  0.  Moreover,  we 

also  assume  that  a  single-element  antenna  is  being  used,  i.e., 
D  =  1.  In  this  way,  the  effect  of  notch  filtering  can  be  singled 
out  from  the  spatial  selectivity  of  a  multielement  antenna  array. 

Suppose  that  the  output  of  the  .  matched  filter  fo(t)  is 
sampled  once  every  Tc  seconds  and  M  samples  are  taken 
to  form  the  vector  vq.  Using  the  results  in  Section  HI,  a 
straightforward  calculation  shows  that  Rz  =  and 

fi^=:C2ef -hiNToriMxM  (31) 


where 

C2  =  y®Tl52||®(«2)|[le"'**^‘ 


and  *  =  y/TPlT[l  0  •  •  -  0]^.  nie  optimal  weight  vector 

is  given  by 


w  =  Rj^z 


1 

NoT 


JVPilPiP  1 

P2|fl2p  C 


62 


(32) 


where 

C7  =  M+2p^y|^2|2|f(52)|V^' 

From  (33),  we  see  that  when  the  narrowband  noise  dominates, 
C  M. 

Taking  a  closer  look  at  the  definition  of  C2.  we  observe 
that  components  of  the  vector  C2  are  just  the  samples  of  the 
single-tone  interference  contribution  nB(f).  From  (32) 


V)^e2 


P2\92?  C-M 
AfPilPiP'  C-1  ■ 


(34) 
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Fig.  4.  Average  maximum  SNR  achievable  in  the  beamformer  example. 


When  M  >  1,  we  can  see  from  (34)  that  the  relative  strength  of 
the  narrowband  noise  component  is  mitigated  by  the  weighting 
operation.  In  particular,  when  the  narrowband  noise  dominates 
over  the  wideband  AWGN,  C7  «  M.  The  ratio  of  the  narrow- 
band  noise  magnitude  to  the  signal  magnitude  in  (34)  is  almost 
zero.  This  means  that  the  weighting  operation  acts  as  a  digital 
filter  to  notch  out  the  single-tone  interference  component,  62, 
from  the  matched  filter  sample  vector  vq- 

As  an  example,  consider  a  single-user,  single-element  an¬ 
tenna  system  with  a  single-tone  interferer.  The  received  power 
of  the  single-tone  interferer  is  20  dB  above  that  of  the  user.  The 
frequency  of  the  interferer  is  exactly  the  carrier  frequency,  i.e., 
62  =  0.  Fig.  5  shows  the  frequency  responses  of  the  optimal 
receiver  for  Af  =  5  and  M  =  9.  The  notch-filtering  nature  of 
the  receiver  is  shown  readily  in  the  frequency  response. 

For  the  case  of  multitone  interference  {Kb  >  1)»  the 
argument  is  basically  the  same.  As  an  example,  we  consider 
the  situation  where  the  received  power  of  each  narrowband 
interferer  is  20  dB  above  that  of  the  user.  The  frequency 
offset  (from  the  carrier  frequency)  of  each  tone  is  uniformly 
distributed  on  [-l/2Tc,  l/2rc).  The  average  maximum 
SNR  achievable  by  the  receiver  is  plotted  against  the 
number  of  multitone  interferes  in  Fig.  6  for  different 
values  of  M. 


C.  RAKE  Receiver 

We  consider  a  single-user  conununication  system  in  a 
multipath  fading  channel  with  AWGN,  i.e.,  iiT  =  1,  n/(0  =  0, 
and  nsit)  =  0.  Again,  we  assume  a  single-clement  antenna 
is  employed  to  eliminate  the  spatial  selective  capability  of  the 
proposed  receiver  structure,  i.e.,  i)  =  1. 


We  begin  with  a  simple  case  in  which  the  path  delays 
for  2  <  A  <  jLi,  are  integral  multiples  of  Tc  and  we  sample  iht 
matched  filter  output  fo{t)  every  Tc  seconds,  i.e.,  Tg  —  Tc-  We 
assume  there  are  altogether  M  paths  which  have  path  delays 
ranging  from  0  to  (M-l)Tc.  For  compatibility  with  the  model 
we  introduced  in  Section  H,  we  can  simply  set  the  gains  of  the 
nonexisting  paths  to  zero.  Using  the  correlation  properties  of 
the  various  components  in  Section  HI,  one  can  obtain  easily 
that  Rn  =  NqTImxM  and 

R^  =  zz^  +  0{l/N)  (35) 


where 


From  Proposition  1,  for  sufficiently  large  TV,  the  optimal 
weight  vector  is  approximately  given  by 


id  =  R„^z  = 


1 

-r-Z. 

NoT 


(36) 


Hence,  the  optimal  weight  vector  is  just  the  vector  z  since  a 
scalar  constant  is  immaterial  in  the  maximization  of  the  SNR. 
Since  the  components  of  z  are  the  path  gains  of  the  lowpass 
equivalent  representation  of  the  paths,  the  optimal  weighting 
operation  is  to  combine  the  energy  in  each  path  coherently. 
This  is  exactly  the  characteristics  of  a  RAKE  receiver  [21]. 

In  general,  it  is  unlikely  that  all  the  path  delays  are  integral 
multiples  of  the  chip  duration.  Therefore,  in  order  to  achieve 
a  finer  resolution,  we  sample  the  matched  filter  output  fo(f) 
at  a  rate  higher  than  the  chip  rate,  i.e.,  Ts<Tc-  Following 
the  development  in  the  simple  case  above,  Rz  still  takes  the 
form  in  (35)  with  the  mth,  m  =  0, 1,  •  •  • ,  Af  —  1,  element  of 
z  given  by  i(mT,)  defined  in  (17).  In  fact,  the  vector  z  is 
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Rg.  5.  Frequency  response  of  the  notch  filter  example. 


Fig.  6.  Average  maximum  SNR  achievable  in  the  notch  filter  example. 


just  the  oversampling  counterpart  of  the  path  gain  vector  in 
the  simple  case.  Unlike  the  simple  case,  the  noise  covariance 
matrix  Rn  is  no  longer  the  identity  matrix.  Nevertheless,  it  is 
easy  to  see  that  iZn  is  a  positive  definite  Hermitian  matrix. 
Hence,  we  have  the  decomposition  Rn  =  where  G  is  the 
square  root  of  Rn-  Note  that  G  is  Hermitian  and  invertible. 


For  sufficiently  large  N,  the  optimal  weight  vector  is  well 
approximated  by 

=  {G-^zf-G-\  (37) 

The  optimal  weighting  operation  described  by  (37)  can  be 
decomposed  into  two  operations  in  cascade.  We  can  treat 
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Fig.  7.  Average  maximum  SNR  achievable  in  the  RAKE  receiver  example. 


the  linear  operation  defined  by  G~^  as  a  noise  whitening 
filter.  Then  the  linear  operation  defined  by  {G  is  just 
a  digital  filter  matched  to  response  of  z  after  the  noise 
whitening  operation.  Notice  that  since  the  noise  samples  are 
not  independent  in  this  case,  a  direct  coherent  combination  of 
the  paths  as  in  the  RAKE  receiver  is  not  optimal.  This  is  why 
the  noise  whitening  filter  is  needed. 

We  use  an  example  to  show  the  advantage  of  sampling 
more  than  once  in  each  chip.  The  single-user,  single-element 
RKitpnna  system  is  used  for  a  multipath  fading  channel.  There 
are  five  paths,  i.e.,  L\  =  5,  and  the  complex  path  gains  are 
independent  zero-mean  complex  Gaussian  random  variables 
with  equal  variance  such  that  the  total  average  power  of  the 
five  paths  is  equal  to  die  original  transmitted  power  Pi .  Except 
the  first  path,  we  assume  each  path  arrives  with  a  uniformly 
distributed  delay  ranging  ftom  [0,  Tmax)  independently,  where 
Tmax  =  5Tc.  The  average  maximum  SNR  achievable  by  the 
receiver  is  plotted  against  the  number  of  samples  in  a  chip,  S, 
in  Fig.  7.  As  we  expect,  the  average  performance  is  better 
when  S  increases.  Note  that  the  vector  size  M  increases 
with  5,  and  hence  performance  improves  at  the  expense  of 
complexity. 

D.  MAI  Suppressor 

For  simplicity,  we  consider  a  communication  system  with 
two  users  on  an  AWGN  channel,  i.e.,  K  =  2.  There  is  no 
nanowband  interference  or  multipath  effect,  i.e.,  nsit)  =  0, 
Life  =  1,  and  gk,i  =  exp  (JOk)  for  A:  =  1, 2.  Again,  we  assume 
a  single-element  antenna  is  being  used,  i.e.,  D  =  1.  Moreover, 
the  matched  filter  ouqiut  f{t)  is  sampled  every  Tc  seconds,  i.e., 
Tf  =  Tc  and  we  use  the  first  M  samples  of  f(t)  to  form  the 
vector  uo- 


Using  the  results  in  Section  m,  we  obtain  that  jR*  is,  again, 
given  by  (35)  and  z  =  V2^r[l  0  -  •  •  0]’’  as  in  the  notch 

filter  case.  On  the  other  hand,  when  the  chips  of  the  interferer 
do  not  align  with  those  of  the  desired  user,  Jtn  is  tri-diagonal 
and  some  form  of  optimal  combining  is  performed.  In  fact, 
the  prewhitening  explanation  in  the  previous  section  can  be 
employed  to  justify  the  optimal  weighting  operation  in  this 
case  with  the  modification  that  only  one  path  is  present  this 
time.  To  illustrate  the  above  argument  by  example,  we  consider 
a  single-element  antenna  system  with  two  users'  The  power  of 
the  interferer  is  20  dB  above  the  desired  user.  Fig.  8  shows  the 
maximum  SNR  achievable  by  the  receiver  when  T2,i  varies 
for  different  values  of  M.  The  maximum  suppression  is  about 
1.5  dB  when  T2,i  =  0.5.  Notice  that  MAI  suppression  is 
achieved  by  exploring  the  asynchrony  between  the  interferers 
and  the  desired  user.  This  result  is  related  to  that  in  [14]. 
The  major  differences  are  that  in  [14],  a  prewhitening  filter 
is  applied  before  the  conventional  matched  filter,  and  the 
delays  of  the  interferers  are  assumed  to  be  random  instead 
of  deterministic. 


E.  Spatially  and  Temporally  Noise-Whitening  Matched  Filter 

The  prewhitening  argument  in  Section  V-C  can  be  gener¬ 
alized  to  the  case  of  general  multiuser,  multielement  antenna 
systems  over  multipath  fading  channels  with  AWGN.  Since  the 
generalization  is  straightforward,  we  state  the  result  without 
going  through  the  derivation.  When  N  is  large,  the  optimal 
weight  vector  is  approximately  given  by  (37)  where,  now, 
the  vector  z  is  the  response  vector,  including  the  spatial 
information,  of  the  matched  filtered  charmel  with  the  desired 
user  signal  as  input  The  noise  component  n  in  the  sample 
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Rg.  8.  Average  maximuni  SNR  achievable  in  the  MAI  suppressor  example. 


vector  t>o  includes  the  MAI,  narrowband  interference,  and  the 
AWGN  components.  The  crucial  point  is  that  a  whitening 
operation  still  exists.  Therefore,  the  weighting  operation  can 
still  be  viewed  as  noise-whitening  discrete-time  matched  filter 
as  in  Section  V-C.  However,  since  the  noise  component  vector 
n  contains  spatial  and  temporal  information,  the  receiver,  in 
this  case,  can  be  interpreted  as  a  spatially  and  temporally  noise¬ 
whitening  matched  filter.  This  point  of  view  is  related  to  that 
in  [10]. 


VI.  Eigenanalysis  Algorithms 

In  this  section,  we  consider  a  class  of  nonadaptive  al¬ 
gorithms  to  obtain  the  optimal  weight  vector.  As  discussed 
in  Section  IV.  the  optimal  weight  vector  is  the  generalized 
eigenvector  of  the  matrix  pencil  {Rz,Rn)-  However,  the  cor¬ 
relation  matrices  Ji*  and  Bn  are  usually  not  known.  In  Section 
IV-A,  we  suggest  a  method  to  circumvent  this  difficulty  by 
employing  the  similarity  in  the  statistical  structures  of  the 
sample  vectors  tio  and  tio-  The  generalized  eigenvector  of  the 
matrix  pencil  iRv,Rv)  is  approximately  the  optimal  weight 
vector  when  the  number  of  chips  per  symbol,  N,  is  large. 
Note  that  this  condition  is  usually  satisfied  in  all  DS/SSMA 
systems.  The  advantage  of  this  formulation  is  that  both  Bo 
and  Bo  can  be  estimated  directly  using  the  samples  taken 
from  the  output  of  the  matched  filter  in  Fig.  1.  Any  consistent 
estimators  of  the  correlation  matrices  Bo  and  B*  can  be  used. 
The  consistency  and  Lemma  2  in  Appendix  I  guarantee  the 
average  SNR  obtained  by  using  the  estimates  of  the  correlation 
matrices  in  the  eigenanalysis  is  close  to  the  maximum  SNR 
when  the  estimates  are  good  enough. 


To  conclude  the  discussion  above,  we  consider  two  ex¬ 
amples  and  use  them  to  demonstrate  the  dependence  of  the 
average  SNR  obtained  by  the  algorithm  stated  above  on  the 
accuracy  of  the  estimates  of  the  correlation  matrices  R„  and 
Bfi.  The  two  examples  are  the  first  example  given  in  Section 
V-A  and  the  example  given  in  Section  V-C  with  the  restriction 
that  the  paths  arrive  exactly  at  the  chip  boundaries  and  we 
sample  once  per  chip,  respectively.  In  both  of  the  examples, 
we  take  M'  =  31  and  use  the  time  average  correlation  matrices 
as  the  estimators  for  the  correlation  matrices,  i.e.. 


Figs.  9  and  10  show  the  average  SNR  obtained  in  the  first 
example  and  the  second  example  for  different  values  of 
J,  respectively.  Also  shown  in  the  figures  are  the  average 
maximum  SNR  achievable  by  the  receiver  and  the  average 
SNR  when  no  weighting  is  performed,  i.e.,  only  the  first 
element  of  the  sample  vector  is  used  as  the  decision  statistic. 
It  is  clear  in  both  cases  that  the  average  SNR  increases  as  J 
increases.  When  J  is  large,  the  estimates  of  the  correlation 
matrices  are  accurate,  and  hence,  the  average  SNR  is  close  to 
the  maximum  SNR.  For  these  two  examples,  we  need  about  20 
symbols  to  get  an  average  SNR  of  1  dB  below  the  mawmum 
value.  For  slowly  time-varying  chaimels,  we  can  modify  the 
estimators  in  (38)  and  (39)  by  applying  a  window  with  a 
carefully  chosen  width. 


number  of  sample  vectors  used  (J) 


Fig.  10.  Average  SNR  obtained  by  the  eigenanalysis  algorithm  in  the  RAKE  receiver  exan:^>le. 


VII.  Adaptive  Algorithms  the  MMSE  and  the  CMOE  criteria  are  asymptotically  equiv- 

The  major  disadvantage  of  the  eigenanalysis  algorithms  to  MSNR.  Similar  to  [7]  and  [8],  the  consideration 

stated  in  Section  VI  is  the  computational  complexity  in-  of  MMSE  and  CMOE  provides  simple  methods  to  develop 

volved  in  obtaining  the  generalized  eigenvector.  It  limits  adaptive  algorithms.  Consideration  of  MMSE  naturally  leads 
the  size  of  the  weight  vector.  As  shown  in  Section  IV,  to  the  well-known  LMS  algorithm  when  training  sequences  are 
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available.  However,  the  requirement  of  training  sequences  is 
undesirable  in  some  DS/SSMA  systems,  especially,  when  the 
communication  channels  are  time-varying.  With  the  CMOE 
criterion,  we  now  develop  a  blind  adaptive  algorithms  to  obtain 
the  optimal  weight  vector. 

From  the  previous  discussion,  we  know  that  direct  estimates 
of  z  from  the  samples  at  the  output  of  the  matched  filter  are  not 
readily  available.  However,  from  (24),  the  correlation  matrices 
and  jRi,  and  hence  the  matrix  zz^  can  be  easily  estimated 
from  the  sample  vectors  Vj  and  Vj .  Therefore,  if  we  have  an 
adaptive  algorithm  which  only  requires  the  knowledge  of  the 
matrices  Ru  and  zz^,  we  can  approximate  this  algorithm  by 
using  estimates  of  R„  and  zz^  instead.  This  is  similar  to  the 
development  of  the  LMS  algorithm  from  the  steepest  descent 
algorithm.  Unlike  the  LMS  algorithm,  the  resulting  algorithm 
would  be  a  blind  adaptive  algorithm  since  no  training  sequence 
is  needed. 

The  asymptotic  equivalence  of  CMOE  and  MSNR  suggests 
that  we  can  deteimine  the  optimal  weight  vector  by  construed 
minimization  of  the  output  energy  (t{w)  when  N  is  sufficiently 
large.  Assume  that  the  matrices  Rv  and  zz^  are  given. 
Consider  the  matrix  P,  defined  by 


P*  = 


zz^ 

Wz 


(40) 


Notice  that  it  defines  a  projection  operation  onto  the  space 
spanned  by  the  vector  z.  Since  z^ z  is  just  the  trace  of  the 
matrix  zz^,  we  can  obtain  P*  from  zz^.  The  gradient  vector 
of  the  function  is  g^(w)  =  R„V3.  Restricted  by  the 
constraint  v)^z  =  c,  vi/e  caimot  descend  in  the  direction 


opposite  to  g^iw)  as  in  die  steepest  descent  algorithm.  Let 
g„{w)  be  the  component  of  which  is  orthogonal  to  the 


vector  z 


g^  =  (I-P,)R„w.  (41) 

Note  that  the  vector  g„  is  determined  directly  by  P*  and  Rv- 
It  is  easy  to  see  that  descending  in  the  direction  opposite  to 
g„{vi)  does  not  violate  the  constraint  w^z  =  c.  Therefore,  we 
set  up  an  adaptive  algorithm  which  descends  in  that  direction 
to  solve  the  constrained  minimization  problem. 

Algorithm  1:  For  j  >  1 

=  [I  -  fJLiJ  -  Pz)Rv]w{j  -  1)  (42) 

where /i>0  and  ti;(0)^a  =  c. 

The  following  proposition,  whose  proof  is  given  in  Appen¬ 
dix  n,  summaries  the  convergence  of  Algorithm  1.  It  solves, 
iteratively,  the  constrained  minimization  problem  whenever  fi 
is  chosen  to  be  adequately  small. 

Proposition  2:  Given  any  ii;(0)  such  that  w{0)^z  =  c.  for 
j  >  1,  update  w{j)  as  in  iMgorithm  1.  Suppose  p  is  chosen 
such  that 

0<M<T^  (43) 

where  A^ax  is  the  largest  eigenvalue  of  the  matrix  {I-Pz)Rv, 
then 


lim  v)[j)  = 

J—fOO 


'  z^w(0) ' 

,z^RZ^z. 


Wjv- 


(44) 


Moreover,  as  j  increases,  (T{w{j))  i 


Unfortunately,  it  is  shown  in  Appendix  II  that  Algorithm  1 
is  numerically  unstable  even  when  p  satisfies  the  condition 
in  Proposition  2.  To  stabilize  the  algorithm,  we  may  put 
some  restrictions  on  the  vector  w{j)  at  each  iteration.  By 
restricting  w{j)  at  each  iteration,  the  constraint  w{j)^z  =  c 
may  not  be  satisfied  at  each  iteration.  Instead  of  considering 
the  minimization  problem  of  the  output  energy,  we  define,  for 
w  such  that  w^z  ^  0, 


ct(ui)  = 


w^RvW 

w^zz^w 


(45) 


and  consider  the  minimization  problem  below 


min  a(v}).  (46) 

Note  that  it  reduces  to  the  minimization  problem  of  the  output 
energy  if  the  additional  constraint  w^z  =  c  is  added. 

Proposition  3  below  shows  that  the  following  numerically 
stable  algorithm  solves  the  constrained  minimization  problem 
of  a{w)  in  (46)  when  p  is  chosen  to  be  adequately  small. 


Algorithm  2: 

For  j  >  1  . 

wU)  =  c(i)[J  -  piJ  -  Pz)RvMj  -  1)  (47) 


where  /x  >  0  and  w{0)^z  ^  0.  At  each  iteration,  the  complex 
constant  c{j)  is  chosen  to  stabilize  the  algorithm  in  the 
following  way. 

Fix  one  m  €  {1,2, •  •  •,MD}.  For  each  j  >  1,  let  Wm{j) 
be  the  with  element  of  If  Wm{j)  ^  0,  choose  c(j)  such 
that  ||ti;0')||  =  1  and  axg  {wmij))  =  0.  Otherwise,  choose 
c{j)  such  that  \\w{j)\\  =  1  and  arg(c(i))  =  0- 
Proposition  3:  Given  any  ti>(0)  such  that  tii(O)  z  ^  0,  for 
j  >  1,  update  v){j)  as  in  Algorithm  2.  Suppose  p  is  chosen 
such  that 

0<p<^  (48) 

^max 


where  Amax  is  the  largest  eigenvalue  of  the  matrix  (I-P z)Rv, 
and  the  mth  element  of  ibn,  w^m  #  0.  where  m  is  the  index 
fixed  in  Algorithm  2.  Then 


lim  w{j)  = 


exp  (-i  arg  (uijvm))  - 


(49) 


Moreover,  as  j  increases,  o{w{j))  i  l/z^R„  z. 

The  proof  of  Proposition  3  and  the  stability  argument  of 
Algorithm  2  can  be  found  in  Appendix  O. 

Note  that  for  Algorithm  2  to  give  the  desired  weight  vector, 
we  have  to  choose  the  initial  vector  ttf(O)  such  that  it  is  not 
orthogonal  to  the  signal  vector  z.  In  practical  situations  with 
synchronization  achieved,  the  first  sample  at  the  output  of  the 
matched  filter  usually  carries  a  significant  portion  of  power. 
In  most  cases,  it  should  not  be  zero.  Hence,  it  is  usually  a  srfe 
choice  to  start  the  algorithms  with  iii(0)  =  [1  0  •  •  •  0]  . 

Moreover,  the  algorithm  also  requires  us  to  fix  the  ph^e  angle 
of  one  of  the  elements  of  the  weight  vector  at  each  iteration. 
By  the  same  line  of  reasoning,  we  can  choose  this  to  be  the 
first  element. 
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Fig.  11.  Average  SNR  obtained  by  the  blind  adaptive  algorithm  in  the  beamformer  example. 


Algorithm  2  requires  knowledge  of  Jit,  and  zz^.  To  obtain 
a  blind  adaptive  algorithm,  we  estimate  Rr,  and  Ry  by  vjvf 
and  Vjvf  at  the  jth  iteration,  respectively.  When  N  is  large, 
it  is  reasonable  to  estimate  zz^  by  vjv^  —  Vjvf.  Replacing 
zz^  and  Ry  in  (47)  by  these  estimates,  we  get  the  following 
blind  adaptive  algorithm. 

Algorithm  3:  For  j  >  I 

w{j)  =  c(i){t£>(i  -  1)  +  ti[VjW{j  -  1)] 

•  [(«f  -  (»f 

=  c{j){I  +  -  Vjvf]Vjvf}w{j  -  1)  (50) 

where  c{j)  and  tn(0)  are  chosen  as  in  Algorithm  2. 

The  first  form  in  (50)  is  for  the  sake  of  implementation, 
while  the  second  form  provides  a  simple  intuitive  interpre¬ 
tation  of  the  algorithm.  Note  that  the  matrix  \{vf  Vj)I  - 
Vji)f]vjvf  represents  the  operation  of  first  projecting  onto  the 
space  spanned  by  Vj  and  then  projecting  Ae  result  onto  the 
space  orthogonal  to  Vj.  Since  Vj  contains  the  signal  and  noise 
components  and  vj  contains  mainly  the  noise  component,  the 
algorithm  can  be  roughly  viewed  as  projecting  the  current 
weight  vector  to  the  direction  of  the  noise  and  signal;  then 
removing  the  noise  part  by  orthogonal  projection;  and,  finally, 
changing  the  current  weight  vector  by  a  small  amount  in  the 
resulting  direction. 

While  an  exact  analysis  of  Algorithm  3  may  be  prohibitively 
difficult,  we  expect  it  to  retain  the  numerical  stability  of 
Algorithm  2.  In  fact,  we  may  view  the  estimation  errors 
of  zz^  and  Ry  as  perturbations  of  the  eigenvalues  in  the 
proof  of  Proposition  3.  Since  Algorithm  2  is  stable  with 
respect  to  perturbations  of  the  eigenvalues,  this  provides  some 


indications  on  the  convergence  of  Algorithm  3.  Therefore, 
it  is  reasonable  to  choose  the  initial  vector  and  /x  to  satisfy 
the  requirement  similar  to  that  in  Proposition  3.  Note  that 
(48)  should  be  modified  to  0</Lfc<2/(Ainax^^^)»  and  the 
randomness  in  this  algorithm  actually  helps  to  loosen  the 
orthogonality  requirement  of  the  initial  vector.  We  also  expect 
behavior  of  Algorithm  3  to  be  similar  to  that  of  the  LMS 
algorithm.  As  a  rule  of  thumb,  we  should  choose  such  that 

2 

0</x<7 - ^ (51) 

(average  energy  m  VjY 

We  reconsider  the  two  examples  in  Section  VI  to  demon¬ 
strate  the  behavior  of  Algorithm  3.  In  both  examples,  we  start 
the  algorithm  by  tt;(0)  =  [10  •  •  -  0]^,  and  the  first  element 
is  chosen  to  be  fixed  in  phase,  /x  is  chosen  to  be  half  of  the 
upper  limit  in  (51).  The  results  of  the  beamformer  example 
aie  given  in  Figs.  11  and  12.  Fig.  11  ^hows  the  average  SNR 
achieved  by  the  weight  vectors  given  by  the  algorithm  at 
each  iteration.  Fig.  12  gives  the  Euclidean  norm  of  the  vector 
difference  between  the  weight  vectors  given  by  the  algorithm 
and  the  ideal  limit  given  in  (49).  Similar  results  of  the  RAKE 
example  are  plotted  in  Figs.  13  and  14. 

Just  like  the  LMS  algorithm,  we  observe  from  Figs.  12  and 
14  that  the  weight  vectors  do  not  converge  to  the  ideal  limits. 
In  fact,  they  approach  the  ideal  limits  closely  and  exhibit  small 
fluctuations  about  the  ideal  limits.  Since  the  weight  vectors  do 
not  converge  to  the  ideal  limits,  the  maximum  SNR  is  not 
reached  as  indicated  in  Figs.  11  and  13.  However,  the  average 
SNR  achieved  after  a  large  number  of  iterations  is  less  than 
a  quarter  of  a  decibel  from  the  maximum  value  in  the  RAKE 
receiver  example,  while  the  corresponding  difference  in  the 
beamformer  example  is  about  1  dB.  In  the  RAKE  receiver 
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Fig.  13.  Average  SNR  obtained  by  the  blind  adaptive  algorithm  in  the  RAKE  receiver  example. 


example,  it  takes  about  15  iterations  (symbols)  to  get  to  an 
average  SNR  of  1  dB  below  the  maximum  value.  That  value 
for  the  beamformer  example  is  about  20  iterations.  Note  that 
the  eigenanalysis  algorithm  in  Section  VI  requires  a  similar 
amount  of  symbols  to  get  to  the  same  average  SNR. 

We  note  (hat  better  estimators  for  the  matrices  zz^  and 
R„  can  also  be  obtained  directly  from  the  matched  filter 


sample  vectors  vj  and  Vj.  However,  these  better  estimators 
lead  to  more  computationally  complex  algorithms.  Therefore, 
we  restrict  our  attention  to  the  simple  estimators  mentioned  in 
this  paper.  Unlike  the  adaptive  algorithms  suggested  in  [3]— [8], 
the  computational  complexity  of  Algorithm  3  is  of  the  order 
of  MD  per  iteration  which  does  not  depend  on  the  spreading 
factor  N. 
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Rg.  14.  Euclidean  norm  of  the  vector  difference  between  the  weight  vector  and  the  ideal  limit  in  the  RAKE  receiver  example. 


Finally,  we  point  out  that  Algorithms  1-3  are  closely  related 
to  the  algorithm  developed  in  [8]  which  is  also  based  on  the 
method  of  constrained  stochastic  gradient  descent.  The  major 
difference  is  that  in  [8],  the  vector  equivalent  to  z  is  known  a 
priori.  Therefore,  there  is  no  need  to  obtain  estimates  of  zz^ 
as  in  Algorithm  3.  On  the  other  hand,  since  no  knowledge 
of  z  is  required  in  Algorithm  3,  the  same  algorithm  can  be 
employed  for  the  mismatch  situations  described  in  [8]. 

Vffl.  Conclusions 

In  this  paper,  we  have  proposed  a  linear  receiving  structure 
which  is  composed  of  the  conventional  matched  filter  followed 
by  a  tapped  delay  line.  The  structure  of  the  receiver  also 
incorporates  the  possible  use  of  an  antenna  array.  When 
the  weights  in  the  t^ped  delay  line  are  chosen  so  as  to 
maximize  the  signal-to-noise  ratio,  the  proposed  receiver  has  a 
unified  capability  of  rejecting  multiple-access  and  narrowband 
interferences,  and  combining  multipath  components.  We  have 
also  developed  nonadaptive  and  adaptive  algorithms  to  obtain 
the  optimal  weights  in  the  tapped  delay  line.  Among  the 
various  algorithms  proposed,  the  blind  adaptive  algorithm, 
which  is  computationally  simple  and  requires  the  same  amount 
of  information  as  the  conventional  matched  filter  receiver,  is 
suitable  for  practical  implementation.  Results  in  this  paper 
are  applicable  to  a  wide  range  of  both  coded  and  uncoded 
direct-sequence  spread-spectrum  multiple-access  systems. 

Appendix  I 

In  this  appendix,  we  prove  Proposition  1. 

We  need  the  following  lemmas  on  the  continuity  of  eigen¬ 
values  and  generalized  eigenvalues,  which  are  corollaries  of 
results  from  perturbation  theory  of  matrices  ([22,  Appendix 
K],  [18,  p.  412],  or  [19]). 


Lemma  1:  Let  be  a  sequence  of  Hermitian 

matrices  which  converges  to  the  Hermitian  matrix  A.  Let 
Amax(A)  and  Axnm(^)  denote  the  largest  and  the  smallest 
eigenvalue  of  A,  respectively.  Then 

lim  AmaxCAiv)  =  Amax  (4)  (52) 

iV— >oo 

lim  Amin(4jv)  =Aniin(4).  (53) 

N—^oo 

Lemma  2:  Let  {An}n^i  be  a  sequence  of  Hermitian 
matrices  which  converges  to  the  Hermitian  matrix  A,  and 
{Bn}n=i  ^  ^  sequence  of  positive  definite  Hermitian 
matrices  which  converges  to  the  positive  definite  Hermitian 
matrix  B.  Moreover,  let  XmaxiAN^B^)  be  the  largest 
generalized  eigenvalue  of  the  regular  matrix  pencil  (Aiv,  Bn), 
and  Amax(A,  B)  be  the  largest  generalized  eigenvalue  of  the 
regular  matrix  pencil  (A,B).  Then 

lim  Ainax(AiV,J5iv)  =  Aniax(A,B).  (54) 

iV— >oo 

Proof  (of  Proposition  1 ):  From  the  discussion  in  Section 
in,  we  know  that  limjv-.oo  ^  exists  and  it  is  positive  definite. 
Let  Rn,oo  denote  this  limit.  It  can  be  shown  easily  that 
Woo  =  Ri^\o^  1®  eigenvector  associated  with  the  largest 
generalized  eigenvalue  of  the  matrix  pencil  {zz^,Rn,oo) 
the  value  of  that  eigenvalue  is  z^R^^oo^.  Therefore,  from 
(22)  and  Lemma  2,  we  have 

lim  max  [SNR  (ttf)]  =  SNRoo  =  z^R^\oZ.  (55) 

N-^oo  w^O  ’ 

Note  that  wn  is  an  eigenvector  associated  with  the  largest 
eigenvalue  of  the  matrix  pencil  (Jl®  -  Ri,Ri)‘  Again  the 
existence  of  the  AWGN  component  guarantees  that  R„-  Ri 
and  Ri  satisfy  the  conditions  required  for  the  sequences 
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and  {Biv}w=i  in  Lemma  2,  respectively.  Applying 
Lemma  2  with  (24),  we  have 


N->oo  W^RvWn 


=  SNRoo. 


(56) 


Since 


lim  {Ri,  —  Rn)  =  0 

iV--+oo 


and 


lim  (Rv  —  Ri,  —  Rz)  =  0 

N'-^oo 


applying  Lemma  1,  we  can  show  that 

w^{Rv  -  Rv)wn  _  w^RzWn 
W^RiWff  W^Rn^N 


0. 


(57) 


Combining  (55H57),  we  have  the  first  inequality  in  Propo¬ 
sition  1. 

Next,  since 


lim  Rn  ^  =  •Rn.eo 

N-*oo  ^  ’ 


lim  viN  =  tOoo- 
JV— ^oo 


Since 

lim  RZ^  =  {zz^  +  Rn,oo)~^, 

iV—voo 

lim  =  (1  +  SNRoo) 

N—*-oo 

by  the  matrix  inversion  lemma.  Moreover,  since  wn  is  a 
complex  multiple  of  limjv-_^oo  is  just  a  complex 
multiple  of  limiv-^oo  Fi'om  (22),  we  have 

lim  ^  (58) 

N-*oo  w^RnViN 


r/>mhining  (58)  with  (55),  we  have  the  second  inequality  in 
Proposition  1.  The  other  two  inequalities  can  be  proved  in  a 
similar  way. 


APPENDIX  n 

Li  this  appendix,  we  prove  Propositions  2  and  3,  and  give 
stability  arguments  of  Algorithms  1  and  2. 

First,  we  need  the  following  lemma  on  the  eigenstnicture 
of  the  matrix  (/  —  Pz)Rv 

Lemma  3:  The  matrix  (/  -  Pz)Rv  has  the  diagonaUzation 

{I  -  Pz)Rv  =  Q^Q~^  (59) 

where  A  is  a  diagonal  matrix  whose  elements  are  the  eigenval¬ 
ues  {Am}mii  of  {I-Px)Rv  and  Q  is  an  invertible  matrix  with 
the  corresponding  eigenvectors  of  {I—Px)Rv  as  its  columns. 
All  the  eigenvalues  are  real  and  nonnegative,  and  exactly  one 
of  diem,  say,  Ai,  is  zero.  Then  the  first  column  of  Q  can  be 
chosen  to  be  wn,  die  MMSE  solution  wn  defined  in  (28). 


Proof:  Note  that  since  Rv  is  positive  definite,  Rv  =  GG, 
where  G,  which  is  invertible  and  Hermitian,  is  the  matrix 
square  root  of  Rv-  Suppose  A  is  an  eigenvalue  of  (^I  —  P z}Rv 
and  X  is  the  corresponding  eigenvector.  Then 

(/  -  Px)RvX  =  (/  -  Pz)GGx  =  Xx.  (60) 
Let  y  =  Gx,  then 

G{J  -  Pz)Gy  =  Xy.  (61) 

From  (60)  and  (61),  since  G  is  invertible,  the  eigenvalues 
of  G(I  -  Pz)G  and  (J  -  Px)Rv  are  the  same.  Moreover, 
if  a;  is  an  eigenvector  of  G(I  —  Pz)G,  then  G  is  an 
eigenvector  of  (J  -  Px)Ro-  Since  G{I  -  Pz)G  is  a  positive 
semidefinite  Hermitian  matrix,  all  its  eigenvalues,  and  hence, 
all  the  eigenvalues  of  (J  -  Pz)Rv  are  real  and  nonnegative. 
Moreover,  G{I  -  Px)G  has  the  diagonaUzation 

G{I-Pz)G  =  UKV^  (62) 

where  A  is  a  diagonal  matrix  whose  elements  are  the  eigen¬ 
values  {Am}m=i  of  G{I  -  Px)G  and  £7  is  a  unitary  matrix 
with  the  corresponding  eigenvectors  of  G{I  —  P x)G  as  its 
columns.  Let 

Q^yjz^IQhG-'^U. 

We  get  (59)  from  (62). 

Set  A  =  0  in  (60).  Since  Rv  is  invertible,  for  x  0,  (60) 
is  satisfied  when  and  only  when  Rv^  is  parallel  to  the  vector 
z.  Moreover,  since  Q  is  of  full  rank,  we  conclude  that  exactly 
one  of  the  eigenvalues  of  (J  -  Pz)Rv  is  zero.  By  rearranging 
the  columns  of  £7  if  necessary,  we  can  set  Ai  =  0  and  the 
first  column  of  Q  to  be  the  eigenvector  corresponding  to  Ai. 
Finally,  from  the  fact  that 

Q^RvQ=-{z^R:^z)I  (63) 

the  first  column  of  Q  is  exactly  vjn.  Cl 

Now  we  can  employ  Lemma  3  to  prove  Propositions  2 

and  3. 

Proof  (of  Proposition  2):  Let  u{j)  =  Q  We  can 

rewrite  (42)  as 

uC;)  =  (/-/xA)ttC;-l).  (64) 

From  (64),  the  mth  element  of  the  vector  u(j)  is  given  by 

UmU)  =  (65) 

where  Um(0)  is  the  mth  element  of  the  initial  vector  tt(0)  = 
Q~^w{0).  Since  Ai  =  0  and  by  (43),  |1  -  pXm\<f,  for 
m  =  we  have 

lim  w{j)  =  <3[.lim  =  ui(0)ti>Ai.  (66) 

J-+00  J— *>oo 

From  (63)  and  (65) 

MD 

trim)  =  53(1  -  pX„,f^Um{0)?.  (67) 

m=l 
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Therefore,  (67)  shows  that  a{w{j))  i  {z^ z)\ui{0)\‘^ . 
Finally,  the  proof  is  completed  by  showing  tii(O)  = 
z^w{0)/z^R^^z.  This  fact  can  be  shown  from  the  identity 


Proof  (of  Proposition  3):  Similar  to  (65),  we  have,  for 
n  =  1,2,  •  •  ■  ,MD 


lU)  =  C{j){l  -  pXnYuniO) 


For  each  j>0,  since  ||tfl(j)||  =  1,  ||Q«(i)|l  =  1-  Since  R„  is 
positive  definite,  so  is  the  matrix  Q^Q.  Denote  the  (m,  n)th 
element  of  Q^Q  by  Applying  these  to  (69),  we  get,  for 
n  = 


.(i)l  = 


_  ^  ( ^'(Q)  V/" ^n'(0)\ 


1-  pXm'Y  / 1  -Mn' Y 

1  -  pXn  )  \1-  PX„  ) 


j1 


if  (1  -^A„)«„(0)  ^  0.  Otherwise,  w„(i)  =  0.  Note  that  since 
2:®tt;(0)  ^  0  and  Ai  =  0,  (1  —  /xAi).wi(0)  5^  0.  If  (48)  is 
satisfied 


|1  —  Mil  >  |1  —  Mn|,  for  n  =  2, 3,  •  •  • , MD. 
Since  qn  =  ||tt)w-|p,  we  have  from  (70) 
lim  |ni(j)l  =  l/||tt>Ar|| 

J— ►OO 

and 

lim  Un{j)  =  0,  forn  =  2, 3,  —  - , MD, 

Combining  these  limiting  results  with  the  fact  that  w(J)  = 

Qu{j) 

|Wm(i)I  |«'JVm|/l|*Ar||. 

Since  WNm  #  0,  Wm(j)  #  0  for  all  sufficiently  laige  j.  Then, 
arg  (tWm(j))  =  0  for  all  sufficiently  large  j.  Hence,  we  have 

lim  arg(wi(j))  =  -axg{wNm)- 

J— +00 

This  implies  that 

lim  ui{j)  =  exp(-y  arg(t£;Ar,„))/||ttfArll 

J-.00 

and,  hence,  (49). 

We  notice  from  (47)  that  z^w{j)  =  C{j)z^ta{0)  for  all 
j  >  0.  Putting  this  and  (63)  in  (45),  we  get 

Therefore,  (71)  shows  that  a{w{j))  J,  l/z^R^^z.  □ 


The  numerical  instability  of  Algorithm  1  can  be  seen  from 
(65)  by  setting  m  =  1.  Since  1  —  fiXi  =  1,  any  perturbation 
on  Ai  caused  by  numerical  or  other  reasons  may  affect  the 
convergence  of  die  element  vi{j),  and,  hence,  the  convergence 
of  Algorithm  L  On  the  other  hand,  Algorithm  2  is  numerically 
stable  when  the  condition  in  (48)  is  satisfied.  It  can  be  seen 
from  (70)  that  the  convergence  of  Algorithm  2  does  not 
depend  on  the  absolute  magnitudes  of  the  factors  1  —  /xA^, 
for  n  =  1,2, but  on  their  relative  magnitudes. 
Therefore,  as  long  as  the  perturbation  on  A i  is  small,  the 
magnitude  of  1  -  /xAi  is  still  the  largest,  and  hence,  Algorithm 
2  converges  to  the  desired  result. 
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Abstract-  Arbitrarily  tight  upper  and  lower  bounds  on 
the  pairwise  error  probability  (PEP)  of  a  trellis-cod^  or 
convolutional-coded  direct-sequence  spread-spectrum  multiply 
access  (DS/SSMA)  communication  system  over  a  Rayleigh 
fading  channel  are  derived.  A  new  set  of  probability  density 
functions  (pdfs)  and  cumulative  distribution  functions  (cars) 
of  the  multiple-access  interference  (MAI)  statistic  is  denv^, 
and  a  modified  boundii^  technique  Is  proposed  to  obtain  tte 
bounds.  The  upper  bounds  and  lower  bounds  together  specify 
the  accuracy  of  the  resulting  estimation  of  the  PEP,  and  pve  an 
indication  of  the  system  error  performance.  Several  suboptimum 
decoding  schemes  are  proposed  and  their  performan^  are 
compared  to  that  of  the  optimum  decoding  scheme  by  the 
average  pairwise  error  probability  (APEP)  values.  The  approac 
can  be  used  to  accurately  study  the  multiple-access  capability  of 
the  coded  DS/SSMA  system  without  numerical  integrations. 

Index  Terms — Code  division  multiaccess,  error  correction  cod¬ 
ing,  error  analysis,  fading  channels. 


I.  Introduction 

PERFORMANCE  of  trellis-coded  modulation  (TCM)  sys¬ 
tems  over  the  Rayleigh  fading  channel  can  be  evaluated 
using  several  different  methods  [l]-[8],  [18].  In  [1],  a  residue 
method  was  used  to  compute  the  exact  pairwise  error  prob- 
ability  (PEP)  of  a  TCM  system  with  imperfect  channel  state 
information  (CSI).  In  [2],  tight  bounds  on  the  PEP  were  first 
obtained,  and  then  the  upper  bound  was  expressed  m  a  product 
form  to  be  used  with  the  transfer  function  bounding  technique 
[17],  [18]  to  get  the  upper  union  bound  (UUB)  on  the  bit¬ 
error  probability  (BEP)  with  ideal  CSI.  In  [4]  and  [6H8], 
the  saddle-point  approximation  (SAP)  method  was  used  to 
approximate  the  PEP.  In  [3]  and  [5],  Chemoff  upper  bounds  on 
the  PEP  were  derived  for  both  the  ideal  CSI  and  no  CSI  cases. 
The  Chemoff  upper  bounds  then  were  used  with  the  transfer 
function  bounding  technique  to  get  the  UUB  on  the  BEP. 

In  general,  Chemoff  upper  bounds  on  the  PEP’s  allow  the 
use  of  the  transfer  function  bounding  technique  to  get  the  UUB 
on  the  BEP  over  the  fading  channel,  but  the  result  is  too  loose. 
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On  the  other  hand,  it  is  impractical  to  compute  the  exact  PEP’s 
(if  obtainable)  for  all  the  error  paths,  and  only  dominant  PEP 
values  can  be  used  to  approximate  the  UUB  on  the  BEP.  The 
situation  becomes  more  complex  when  the  multiple-access 
channel  is  considered.  Arbitrarily  tight  performance  bounds 
of  asynchronous  direct-sequence  spread-spectrum  multiple- 
access  (DS/SSMA)  systems  have  been  computed  for  both  the 
uncoded  [10HI2]  and  coded  [14]  systems  over  the  additive 
white  Gaussian  noise  (AWGN)  channel  when  determinis¬ 
tic  spreading  sequences  with  arbitrary  chip  waveforms  are 
used.  In  this  work,  the  probability  density  function  (pdf) 
and  cumulative  distribution  function  (cdf)  of  the  multiple- 
access  interference  (MAI)  statistic  from  each  interfering  user 
were  first  computed,  and  then  the  (one-dimensional)  bounding 
technique  [11]  was  applied  to  get  the  results.  In  this  paper, 
we  find  that  the  MAI  statistic  needed  for  the  computation 
of  the  performance  bounds  of  the  coded  DS/SSMA  systems 
over  Rayleigh  fading  channels  is  quite  different  from  that  over 
the  AWGN  channel;  in  particular,  a  multidimensional  pdf  is 
needed  to  characterize  the  MAI  statistic,  and  the  support  in 
each  dimension  of  the  pdf  is  nonnegative.  Therefore,  a  new 
set  of  pdfs  and  cdfs  of  the  MAI  statistic  is  derived,  and  a 
modified  bounding  technique  is  proposed,  together  with  the 
residue  method  [1],  to  obtain  the  arbitrarily  tight  upper  and 
lower  bounds  on  the  PEP  of  the  coded  DS/SSMA  systems. 

It  was  shown  in  [7]  and  [8]  that  the  SAP  method  can  be 
used  to  approximate  the  PEP  for  narrowband  coded  systems 
over  a  general  class  of  fading  channels,  and  can  avoid  some 
numerical  problems.  However,  in  this  paper,  we  focus  on 
the  exact  PEP  only.  The  exact  PEP  can  serve  as  a  reliable 
reference  to  other  possible  upper  bounds  on  the  PEP,  an 
is  an  indication  of  the  system  error  performance  by  itselt. 
Hence,  the  SAP  method  is  not  studied  here.  Also,  the  upper 
union  bound  on  the  BEP  can  be  approximated  by  using 
the  dominant  PEP  values,  but  this  issue  is  not  discusse 
here.  Several  suboptimum  decoding  schemes  are  proposed  ana 
their  performances  are  compared  to  that  of  the  maximum- 
likelihood  optimum  decoding  scheme  by  the  average^pairwise 
error  probability  (APEP)  values.  We  find  that  the 
of  each  suboptimum  decoding  scheme  is  very  close  o 
of  the  optimum  decoding  scheme  under  different  condiUon^ 
The  formulas  derived  in  this  paper  are  suitable  tor 
systems  over  a  slow  Rayleigh  fading  channel  with  one-  or  two- 
dimensional  signal  sets,  imperfect  CSI. 

ideal  interleaving,  and  for  asynchronous  DS/SSMA  systems 
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Fig.  2.  Transmitter/receiver  pair  of  the  first  user. 


with  arbitrary  chip  waveforms.  We  call  the  system  under 
study  the  TCM/SSMA  system.  The  techniques  described  in 
this  paper  can  also  be  used  to  analyze  convolutional-coded 
SSMA  (CC/SSMA)  systems. 

The  paper  is  organized  as  follows.  In  Section  II  we  present 
the  system  model.  In  Section  III  we  characterize  the  MAI 
statistic  and  propose  a  modified  bounding  technique.  Section 
IV  uses  the  residue  method  and  the  modified  bounding  tech¬ 
nique  to  derive  the  upper  and  lower  bounds  on  the  PEP  for 
both  the  optimum  and  suboptimum  decoding  schemes.  Finally, 
Section  V  offers  some  conclusions. 

n.  System  Model 

The  system  model  (see  Fig.  1)  that  we  examine  is  sim¬ 
ilar  to  that  used  frequently  in  previous  studies  of  asyn¬ 
chronous  TCM/SSMA  systems  [13],  [14].  There  are  K  trans¬ 
mitter/receiver  pairs  (see  Fig.  2)  sharing  a  common  channel. 

A.  Transmitter 

The  input  data  sequence  is  divided  into  blocks  of  n  bits. 
Next,  nl  of  these  bits  select  a  coset  of  the  signal  constellation 
through  a  rate  nl/(nl  +  1)  convolutional  encoder,  while  the 
remaining  n2  select  a  signal  point  within  the  given  coset.  This 
is  the  mapping  by  set  partitioning  [16].  We  define  the  fcth 
user’s  baseband  coded  signal  by 


where  is  the  output  symbol  sequence  from  the  trellis 

encoder  of  the  fcth  transmitter,  T  is  the  symbol  interval,  and 
PtH)  is  the  unit  pulse  on  [0,  T).  We  normalize  the  signal 
constellation  so  that  the  average  signal  energy  equals  1.  All 
signature  sequences  are  deterministic.  The  signature  sequence 
where  each  S  {-1,  1},  is  used  to  form  the  fcth 
user’s  spreading  waveform  that  is  given  by 

CXD 

aW(t)=  ^  a\^U{t-hTc)  (2) 

h=-~oo 

where  ipit)  is  the  common  chip  waveform  for  all  signals.  The 
chip  waveform  t*  time-limited  to  [0,  Tc),  where  the  chip 
duration  Tc  satisfies  Tc  =  T/N,  and  N  is  the  period  of  the 
signature  sequence.  The  chip  waveform  is  also  normalized 

such  that  Jq  °  ~  Ne¬ 

llie  baseband  transmitted  signal  of  the  fcth  user  is  given  by 

exp(i</>fc)  (3) 

where  P  is  the  power  of  the  transmitted  signal,  and  tpk  is  the 
phase  of  the  fcth  transmitter. 

B.  Receiver 

The  received  baseband  signal  for  all  receivers  is 
r{t)  = 

^  -  Tk)b^'°\t  -  Tk)  exp  (jOk)  +  n{t) 


where  is  a  sequence  of  independent,  zero-mean, 

complex,  Gaussian  fading  processes  with  a  common  variance 
of  a^  =  l/2  parameter  that 

accounts  for  propagation  delay  and  the  lack  of  synchronism 
between  the  transmitters,  0k  =  4>k  —  ^cTk  is  the  phase  angle 
of  the  fcth  carrier,  Uc  is  the  carrier  frequency,  and  n{i)  is  the 
complex  envelope  of  the  channel  white  Gaussian  noise  with 
double-sided  power  spectral  density  (psd)  No-  The  notation 
£;{•}  denotes  expectation.  We  assume  that  5^*'^*)  =  9j  _  over 
the  jth  interval  (i.e.,  slow  fading),  and  ideal  interleaving  in  this 
study.  Then  are  independent  and  identically  distributed 
(i.i.d.)  complex  Gaussian  random  variables,  each  with  zero 
mean  and  variance  Symmetry  allows  us  to  consider  only 
the  receiver  that  is  listening  to  the  first  transmitter  (the  “desired 
receiver’’).  Accordingly,  ti  =  6i  =  0,  and  for  fc  1,  the 
delays  Tk  and  the  phase  angles  0k  are  random  variables  that 
are  uniformly  distributed  on  [0,  T)  and  [0,  27r),  respectively. 
The  average  received  symbol  energy  and  bit  energy  are  given 
by  Es  =  2alPT  and  Eb  =  l/nEs,  respectively. 

Correlation  receivers  are  used  to  demodulate  the  received 
signal  (see  Fig.  2).  The  decision  statistic  of  the  first  user  at 
interval  j  is  given  by 

+  +  (5) 

k=2 


6('=)(0=  f;  bfvTit-jT) 

j--oo 


where 


(1) 


=  ^2PT/Nogf\ 


fc  =  1,  •  •  •  ,  jff 
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is  a  zero-mean  complex  Gaussian  random  variable  with 
ance  al  =  E,/No,  nj  is  a  zero-mean  complex  Gaussian 
random  variable  with  variance  cr^  =  1,  and  Ij  is  the  MAI 
from  the  fcth  user  given  by 

/j*>  =  i  [bfliRk,  i(Tfc)  +  i(rfc)]  exp  {jOk)  (6) 

where  Rk,i{-)  and  Rk,i{-)  are  the  continuous-time  partial 
crosscorreiation  functions  of  the  fcth  and  first  spreading  wave¬ 


and 

RkM'nc)  =  Cfc,i(Afc)(rc  -  Sk)  +  CkA^k  +  (11) 

where  Xk  is  uniformly  distributed  on  the  set  {0,  •  •  • ,  )V  -  1}, 
sk  =  Tk- XkTc  is  uniformly  distributed  on  [0,  Tc),  and  Cfc,  i  (•) 
is  the  aperiodic  crosscorrelation  function  defined  in  [9].  Note 
that  the  random  variables  Xk  and  Sk  are  independent.  By 
replacing  Rk,i{-)  and  Rk,i{-)  by  (10)  and  (11)  in  (6),  we 

can  express  as 


forms  defined  in  [9]. 

The  channel  state  information  (CSI)  is  available  for  the 
desired  receiver,  and  is  denoted  by  the  sequence  t;  =  {I’y}. 
where  vj  is  the  estimate  of  \  We  assume  that  Vj’s  are 
i.i.d.  complex  Gaussian  random  variables  wi*  zero  mean  Md 
variance  of  and  the  covariance  of  Uj  and  Vj  is 
The  correlation  coefficient  of  and  Vj  is  then  given  by 
„  _  (r2^/<T„(T„.  We  assume  that  the  desired  receiver  has 
perfect  knowledge  of  u.  If  we  condition  on  vj,  the  random 
variable  is  then  a  complex  Gaussian  random  variable  with 

mean  u{orJ<T^)vj  =  pVj  and  variance  o2(l  -  |uP)  =  "pe 
knowledge  of  the  MAI  statistic  for  the  desired  receiver  (dashed 
line  in  Fig.  2)  will  be  discussed  in  Section  IV.  We  assume 
either  we  have  perfect  knowledge  of  it  (optimum  decoding), 
or  no  knowledge  of  it  (suboptimum  decoding). 


m.  Characterization  of  the 
MAI  Statistic 

In  this  section,  we  characterize  the  MAI  statistic  in  a 
TCM/SSMA  system  with  the  Rayleigh  fading  channel.  For 
simplicity,  we  study  the  rectangular  chip  waveform  only. 
However,  we  can  obtain  the  results  for  the  affiitrary  chip 
waveform  case  by  applying  the  techniques  used  in  [10]. 

Without  loss  of  generality,  we  assume  that  an  error  symbol 
sequence  diverges  from  the  transmitted  symbol  sequence  firom 
the  first  demodulation  interval,  and  remerges  with  the  trans¬ 
mitted  sequence  after  L  intervals.  We  will  see  that  the  error 
performance  of  the  system  under  study  depends  on  the  MAI 
through  the  random  vector  I  given  by 

where 

k=2  fc=2 

for  j  =  1  •  •  •  L,  and  is  as  defined  in  (6).  Since  the 
MAI  from  different  interfering  users  are  independent,  I  can 
be  obtained  by  the  convolution  of  the  random  vectors  I 
defined  as 

=  (9) 

for  A:  =  2  •  K.  It  is  shown  in  [9]  that  for  0  <  XkTc  <Tk  < 

(Afc  -1-  l)2c 

Rk  i{rk)  =  Ck,  i{Xk  -  N)iTc  -  Sk)  +  Ck,  i(Afc  -N  +  l)sk 

(10) 


=  -  Uk)?+[B[yk+Bi^]{^  -  Uk)? 

'  '  '  (12) 

where  Uk  =  Sk/Tc  is  uniformly  distributed  on  [0,  1),  and 

aW  =  1  Ck,  i{Xk-N  +  l)  +  bf]  Ck,iiXk  +  1)] 

=  1  [bQj-i  Ck, i(Afc  -N  +  l)  +  b^Q^j  Ck,i{Xk  +  1)] 
Bi'^]  =  ^ Ck,i{>^k  -N)  +  Ck, i(Afc)]  (13) 

with  bf]  =  Re  {6f  }  and  =  Im  {bf  }. 

A.  Modified  Bounding  Technique 

In  order  to  evaluate  the  multiple-access  capability,  we  mod¬ 
ify  the  bounding  technique  introduced  in  [11]  as  follows.  We 
define  the  discrete  random  vector  Pl**)  =  [Pi  S  •  •  •  >  -Di,  1» 
which  characterizes  /(‘'I,  as 

Df**)  =  [ii,  •  •  • ,  fx,],  if  €  V(n,  •••,*£,)  (14) 

where  V(h,  •  •  • ,  it)  is  a  cube  defined  as 

V(ii,  •••jit) 


Nu  is  a  positive  integer,  and  /(*)  is  ^  defined  in  (9).  The 
probability  mass  function  (pmf)  of  P(  )  is  defined  as 

d('=)(ii,  •  •  • ,  it)  =  Pr  =  Hi.  •  •  •  - 


It  is  examined  in  the  next  subsection. 

We  define  the  discrete  random  vector  D,  which  charactenzes 
I,  as  D  =  The  pmf  of  D  is  defined  as 


d(ii,---,it)  =  Pr(P  =  Hi.---.*i^])- 

:ince  the  random  vectors  in  the  set  {P(*);  A;  =  2,  ^ 
re  independent,  d{;  -  •  • ,  •)  can  be  computed  by 
1  discrete  convolution  of  the  discrete  densities  o  (•,  >  )• 

jct  the  cube  VK{h,  •  •  • ,  it)  be  defined  as 


VR:(ii,---,it)  =  |(2:i. 


i,  ii+K-1 


Nu 

i  =  i. 


•’4 


(18) 
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Then  the  probability  mass  d{ii,  iz,)  means  part  of  the 
probability  that  I  falls  in  the  cube  VK{ii^  •  •  * ,  il)*  This  will 
be  used  to  compute  the  upper  and  lower  bounds  on  the  PEP 
in  Section  IV. 


B. 

From  (8),  (9),  (12),  and  (13),  if  we  condition  on  the  sequence 
of  random  variables  and  Afc,  the  random  vector 

is  a  quadratic  function  of  the  single  random  variable  f/fc.  The 
conditional  pdf  and  cdf  of  are  given  in  the  Appendix.  For 
L  =  1,  the  conditional  pmf  of  is  given  by 

where  (•)  is  the  cdf  of  that  is  given  in  the  Appendix. 
For  L  >  1,  the  conditional  pmf  of  is  given  by 


f(fc) 


< 


ij  +  1 


We  do  not  have  a  closed-form  formula  for  (20).  However,  this 
expression  can  be  obtained  by  solving  a  set  of  L  equations.  To 
obtain  the  (unconditional)  pmf  of  we  have  to  average 
(19)  or  (20)  over 


C.  Expected  Value  of  Ij 
From  (12),  if  we  condition  on  the  random  variables 


(fc) 


Aj*^-,  Bi’],  and  conditional  expected  value  ^j 

is  given  by  1/3  [(A^*^.)^  -I-  A^^J-  A^*^-  -I-  -b 

+  {B^^)^].  If  the  average  value  of  the  symbols  in 
the  signal  set  is  zero,  then  by  averaging  over  the  above  random 
variables,  the  expected  value  of  Ij  can  be  shown  to  be 

)  =  ^I2«»:,i(0)  +  «fc,i(l)]  =  Mf  (21) 


of 


where 


iV-l 


(22) 


l=l-N 


and  Cfe,  i(-)  is  the  aperiodic  crosscorrelation  function  defined 
in  [9].  The  expected  value  of  Ij  is  then  given  by 


E{Ij)  =  Y,  =  Mi-  (23) 

fc=2 

The  value  M/  will  be  used  to  form  a  suboptimum  decoding 
metric  in  Section  IV. 


IV.  Performance  Analysis 

In  this  section,  the  upper  and  lower  bounds  on  the  PEP 
are  derived  for  the  optimum  and  several  suboptimum  de¬ 
coding  schemes.  Let  the  transmitted  symbol  sequence  be 
6  =  }^_i  and  an  error  symbol  sequence  be  6  =  (bj 

where  L  is  the  length  of  the  period  in  which  6  diverges  from  h. 


The  PEP  means  the  probability  that  the_  decoder  chooses  b 
instead  of  b,  and  is  denoted  by  Pr  (b  ->  b).  In  order  to  fairly 
compare  the  performances  of  different  decoding  schemes,  we 
define  the  average  pairwise  error  probability  (APEP)  as 

Pr(b^b)=lPr(b-^b)+iPr(b^b).  (24) 

Note,  that  the  APEP  is  the  Bayes  error,  and  the  optimum 
decoding  scheme  is  the  scheme  that  minimizes  the  APEP,  not 
the  PEP.’  However,  it  suffices  to  give  the  derivation  of  the 
upper  and  lower  bounds  on  the  PEP  only. 


A.  Optimum  Decoding  Scheme  (SO  Scheme) 

If  we  have  perfect  knowledge  of  the  MAI  statistic  I,  then  by 
conditioning  on  the  symbol  sequence  and  the  CSI  for  u^^^’s, 
the  decision  statistics  Tj’s  are  independent  complex  Gaussian 
random  variables  and  the  maximum-likelihood  optimum  de¬ 
coding  metrics  of  Ae  correct  symbol  sequence  b  and  the  error 
symbol  sequence  6  are  given  by 


M(6)  =  Y 

i=i 


Tj  -  PVjbf'> 

1 

_  _L  In  f 

2o| 

(25) 


and 


M{b)  =  Y 


rj  -  /3vjbf^ 

2 

-  4-  In  (Tf^A 

2a^j 

-t- 

(26) 


where 


and 


=  bf'>  ^ol  +  l  +  ollj  =  o%  j 


-I- 1  -f  (j\lj 


(27) 


for  j  =  1  •  •  •  2/.  The  parameters  /9,  a^,  and  are  defined  in 
Section  II-B,  and  Ij  is  as  defined  in  (8).  In  practice,  the  perfect 
knowledge  of  I  is  not  available,  and  the  variances  <x^  j  and 
Wq  j  have  to  be  replaced  with  some  approximations.  However, 
the' performance  of  the  optimum  decoding  scheme  serves  as  a 
benchmark  for  other  suboptimum  decoding  schemes. 


B.  Suboptimum  Decoding  Schemes 

In  this  subsection,  we  assume  that  we  have  no  knowledge  of 
the  MAI  statistic  I,  and  propose  three  suboptimum  decoding 
schemes.  We  use  the  same  forms  of  decoding  metrics  M(6)  and 
M(6)  as  defined  in  (25)  and  (26)  and  compute  the  variance 
terms  and  w'j  by  three  kinds  of  approximations,  instead 
of  the  optimum  values  Oq  j  and  ^oj-  Note  that  the  three 
decoding  metrics  introduced  here  are  used  for  either  the 
simplicity  or  the  resulting  fairly  good  performances,  and  the 
same  approach  can  be  used  to  get  the  bounds  on  the  PEP  when 
other  possible  decoding  metrics  are  used.  The  approximation 
values  and  notations  are  given  as  follows. 
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First  Suboptimum  Decoding  Scheme  (SI  Scheme):  The  first 
suboptimum  decoding  metric  is  the  optimum  decoding  metric 
for  the  case  of  no  MAI.  In  this  case,  we  have 


n-2  ^ 

aj  bj 


a?  +  l 


CT-  +  1 


for  the  jth  decoding  interval.  The  SI  scheme  is  useful  when 
the  MAI  is  small  or  when  some  MAI  cancellation  scheme  is 
employed  in  the  receiver. 

Second  Suboptimum  Decoding  Scheme  (S2  Scheme):  The 
second  suboptimum  decoding  metric  uses  a  constant  value, 
say  1,  as  the  variance  term.  In  this  case,  we  have 

=  =  l  (29) 

for  each  interval  j.  The  S2  scheme  has  the  advantage  of 
simplicity,  and  is  useful  when  the  MAI  is  large. 

Third  Suboptimum  Decoding  Scheme  (S3  Scheme):  The  third 
suboptimum  decoding  metric  uses  the  expected  value  of  the 
MAI  statistic  Ij  in  the  approximation  of  the  variance  term.  In 
this  case,  we  have 


The  notation  *  above  denotes  complex  conjugate.  Given 
and  Ij,  Tj,  and  Vj  are  zero-mean  complex  Gaussian  random 
variables  with  variances  of  +  Ij)<^u  +  ^ 

respectively,  and  a  covariance  of  6^  Hence,  (32)  is  a 
quadratic  form  of  complex  Gaussian  random  variables.  Let 
$j  (s)  be  the  two-sided  Laplace  transform  of  the  pdf  of  the 
random  variable  Vj.  Then  with  a  slight  modification  to  [19] 
or  [20],  $j(s)  is  given  by 

$,(s)  =  7 - - r  (37) 

^  {s  -  Plj){s  -  P2j) 

where 

[P.,,  ]  ^  h  -  (38) 

\V2, i  J  -  Wj  +  -P\,  jP2,  j 

are  the  left  plane  (LP)  and  right  plane  (RP)  poles,  respectively, 
with 


aj  =  \by\crl  +  l  +  alMi  (30) 

for  the  jth  interval,  where  Mi  is  as  defined  in  (23).  The 
performance  of  the  S3  scheme  will  be  shown  to  be  very 
close  to  that  of  the  optimum  decoding  scheme  for  all  MAI 
conditions.  The  reason  for  this  is  that  the  pdf  of  the  MAI 
is  highly  centered  around  its  expected  value.  In  order  to 
use  this  scheme,  the  knowledge  of  the  number  of  users  K 
is  assumed. 

C.  PEP  of  Coded  System 

In  order  to  investigate  the  system  performance  for  both  the 
optimum  and  suboptimum  schemes,  we  use  the  general  metrics 
(25)  and  (26)  in  the  following  derivation.  The  decoder  will 
choose  b  instead  of  h  if  M(5)  <  M(6),  or 

V  =  ^Vj<8  (31) 


Vj  =  Aj\Tj\^  +  Bj\vj\^  +  CjTjV*  +  qr*Vj  (32) 


2a]  2a] 


PljP2,j  = 


2\u\‘^<£]alalj 

<T?[(l-H2)(|6fp-|4'^|2)  +  HX- 

2\u\^d]alj  ~ 


d]  =  \bf'> -bf^\  .  (41) 

Since  the  random  variables  Vfs  are  independent  (we  as- 
sume  ideal  interleaving  in  this  study),  the  two-sided  Laplace 
transform  of  P  =  Ej  is  given  by 

^p(s) = n 

i6»7 

where  the  product  is  taken  over  the  set  t]  of  j  for  which 
b^}^  b^^^  and  the  region  of  convergence  is  the  vertical  strip 

enclosing  the  jm-axis  bounded  by  the  closest  poles  on  either 
side.  If  we  condition  on  the  MAI  statistic  I,  the  conditional 
PEP  is  a  Bromwich  contour  integral  [21],  and  can  be  computed 
by  the  residue  method  as 


Pr(6->fc|/)=  / 
Jp 

_  f  -  ^  Residi 


[e''*$p(s)/s]ds 


_  /  -  X)  Residue  [e**$p(s)/s]RPp„,.. ,  if  5  <  0 

“  1  53  H®***!**®  [e**^p(s)/«lLPp,i..  U  {0}>  If  ^ 

^  (43) 
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where  p  is  some  constant  satisfying  0  <  p  <  first  RP  pole  of 
^p(s)  and  6  is  as  defined  in  (36). 

In  order  to  examine  the  effect  of  MAI  on  the  system  error 
performance,  we  rewrite  (39)  as 


r(l) 


Es 


PIJP2J  =  *“■ 


Nq  +  PjEi 


E. 


(44) 


No  +  IjEs 


for  the  optimum  decoding  scheme.  If  we  fix  E^,  we  see  that 
an  increase  in  Ij  can  be  regarded  as  an  increase  in  thermal 
noise  psd  No,  and,  hence,  can  increase  the  PEP.  By  using 
(18),  the  upper  bound  (UB)  and  lower  bound  (LB)  on  the 
PEP,  Pr(6  b),  are  given  by 


UB  =  ^  Pr{6  — ►  6|  J  =  * '  *  j  ^l)}  d{ii,  •  •  * ,  ^l) 

tl,***.*L 

(45) 


and 


LB  =  ^2  -^611  =  *•* »  u)} d{ii,  **•,  u) 

(46) 


where 

and 


ii-^K  -1 
Nu 


+  K  —  1 
Nu 


(47) 


Dhiiu  *'•)  = 


(48) 


The  bounds  become  tighter  as  Nu  increases.  The  UB  and  LB 
on  the  APEP  can  then  be  obtained  via  formulas  (24),  (45),  and 
(46).  From  (44),  we  also  see  that,  since  {Eg/No  +  IjEg) 
1/Ij  as  Eg /No  oo,  it  is  the  MAI  that  limits  the  system 
performance. 


Fig.  3.  APEP’s  of  the  rate  2/3  TCM/SSMA  system  with  an  8-PSK  signal 
set  versus  the  UUB  on  the  BEP  of  the  uncoded  system  with  a  4-PSK  signal 
set.  Various  decoding  schemes  are  considered,  and  both  the  UB  (formula  (45)) 
and  LB  (formula  (46))  are  plotted  with  Nu  =  4000.  The  SNR,  EbfNo,  is 
fixed  to  20  dB. 


Fig.  4.  APEP’s  of  various  decoding  schemes  in  an  uncoded  16-QAM  system. 
Both  the  UB  (formula  (45))  and  LB  (formula  (46))  on  the  APEP  are  plotted 
with  Nu  =  4000.  The  SNR,  Eb/No,  is  fixed  to  20  dB. 


D.  Numerical  Examples 

In  this  subsection,  numerical  examples  are  given  to  illus¬ 
trate  the  tightness  of  the  performance  bounds  derived  in  this 
paper.  We  also  compare  the  performances  of  the  suboptimum 
decoding  schemes  (Section  IV-B)  with  that  of  the  optimum 
decoding  scheme  (Section  IV-A)  and  suggest  the  appropriate 
suboptimum  decoding  schemes  for  various  CSI,  MAI,  and 
signal  set  conditions.  Both  the  UB  and  LB  are  given  for 
the  APEP.  The  SNR,  E^/No  =  {l/n)Eg/No,  equals  20  dB, 
and  the  rectangular  chip  waveform  is  used  throughout  the 
examples.  The  parameter  N^,  which  controls  the  tightness  of 
the  bounds,  equals  4000  in  Figs.  3  and  4,  and  equals  1000 
in  Fig,  5. 

In  Fig.  3,  we  compare  one  dominant  APEP  of  a  TCM/SSMA 
system  with  the  UUB  on  the  BEP  of  an  uncoded  system  for 
the  four  decoding  schemes  SO,  SI,  S2,  and  S3.  Gold  sequences 
with  length  N  =  127  [15]  are  used.  For  the  TCM/SSMA 
system,  an  8-PSK  signal  set,  perfect  CSI  («  =  1),  and  a  rate 
2/3,  8-state  convolutional  encoder  [16]  are  used  in  the  TCM 


part.  The  correct  and  incorrect  symbol  sequences  are  chosen 
to  be  6  =  {1,  1}  and  6  =  {-1,  i},  respectively.  Hence,  we 
have  L  =  2  and  r)  =  {1,  2}.  For  the  uncoded  system,  a  4- 
PSK  signal  set  is  used  and  perfect  CSJ  is  assumed.  Since  the 
equal-energy  signal  sets  (MFSK)  are  used,  the  four  decoding 
schemes  are  equivalent  for  the  ur?coded  system,  and  the  three 
suboptimum  decoding  schemes  are  equivalent  for  the  coded 
system.  Hence,  only  one  pair  of  UB  and  LB  is  given  for  the 
UUB  on  the  BEP  of  the  uncoded  system,  and  two  pairs  of  UB 
and  LB  are  given  for  the  APEP’s  of  the  coded  system.  It  is 
observed  that  the  performances  of  the  suboptimum  decoding 
schemes  and  the  optimum  decoding  scheme  are  very  close  for 
all  the  MAI  conditions,  which  are  specified  by  K, 

In  Fig.  4,  we  compare  the  APEP’s  of  various  decoding 
schemes  in  an  uncoded  16-QAM  system  for  the  error  event 
with  b  =  {l/\/l0(3  +  z3)}  and  &  —  {l/yi0(~l  —  t)}.  Gold 
sequences  with  length  N  =  127  [15]  are  used.  Also,  the 
imperfect  CSI  with  u  =  0.99  is  considered.  Since  the  two 
symbols  in  the  error  event  do  not  have  the  same  energy  and 
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Fig.  5.  APEP’s  of  various  decoding  schemes  in  the  rate  3/4  TCM/SSMA 
system  with  a  16-QAM  signal  set.  Both  the  UB  (fonnula  (45))  and  LB 
(formula  (46))  on  the  APEP  are  plotted  with  Nu  =  1000.  The  SNR.  Eb/No, 
is  fixed  to  20  dB. 

|w|  ^  1,  the  four  decoding  schemes  are  not  equivalent.  It  is 
observed  that  the  performance  of  the  S3  scheme  is  very  close 
to  the  SO  scheme  for  all  values  of  K.  As  expected  from  the 
formulations  in  Section  IV-B,  the  SI  scheme  is  appropriate  for 
use  when  the  number  of  users  K  is  small,  and  the  S2  scheme 
is  appropriate  for  use  when  the  number  of  users  K  is  large. 

In  Fig.  5.  we  compare  one  APEP  of  various  decoding 
schemes  in  a  TCM/SSMA  system.  A  16-QAM  signal  set  and 
a  rate  3/4,  8-state  convolutional  encoder  [16]  are  used  in  the 
TCM  part.  Gold  sequences  with  length  iV  =  31  [15]  are  used. 
Also,  the  imperfect  CSI  with  u  =  0.8  is  considered.  Both  the 
UB  and  LB  are  given  for  the  APEP,  where  the  correct  and 
incorrect  symbol  sequences  are  chosen  to  be 

b  =  {l/v^(-l  -  i),  l/v/l0(3  -f  r3)} 
and 

b  =  {1/vT0(3  -  i),  l/v^(l  -  *3)}. 

Hence,  we  have  L  =  2  and  r?  =  {1,  2}.  We  see  again  that  the 
S3  scheme  is  appropriate  for  use  for  all  the  MAI  conditions. 
The  performance  of  this  system  is  not  very  good  since  both 
N  and  |u|  are  not  large  in  this  example. 

V.  Conclusion 

In  this  paper,  we  have  derived  a  new  set  of  pdfs  and 
cdfs  of  the  MAI  statistic  and  have  proposed  a  modified 
bounding  technique  to  obtain  tight  upper  and  lower  bounds 
on  the  PEP  of  the  coded  DS/SSMA  system  over  the  slow 
Rayleigh  fading  channel  with  imperfect  CSI.  The  approach 
yields  bounds  that  can  be  made  arbitrarily  tight  and  avoids 
numerical  integrations.  The  tight  bounds  can  be  used  to 
accurately  study  the  multiple-access  capability  of  the  coded 
DS/SSMA  systems  and  to  compare  the  performances  of  the 
suboptimum  decoding  schemes  with  that  of  the  optimum 
decoding  scheme. 

The  comparison  among  the  four  decoding  schemes  are 
summarized  as  follows.  For  the  uncoded  system,  or  error 
event  with  length  L  =  1  in  the  coded  system,  the  four 


decoding  schemes  are  equivalent  if  an  equal-energy  signal  set 
is  used,  or  if  perfect  CSI  is  assumed.  For  the  error  event  with 
length  L  >  1  in  the  coded  system,  the  three  suboptimum 
decoding  schemes  are  equivalent  if  an  equal-energy  signal  set 
is  used,  or  if  perfect  CSI  is  assumed.  The  performances  of  the 
suboptimum  and  optimum  schemes  are  shown  to  be  very  close. 
Finally,  if  a  noncqual-encrgy  signal  set  is  used  and  imperfect 
CSI  is  assumed,  the  SI  scheme  is  appropriate  for  use  when 
the  MAI  is  small,  the  S2  scheme  is  appropriate  for  use  when 
the  MAI  is  large,  and  the  S3  scheme  is  appropriate  for  use 
for  all  MAI  conditions.  Hence,  the  S3  scheme  is  suggested  for 
use  when  knowledge  of  the  number  of  users  K  is  available. 


VI.  Appendix 

pdf  AND  cdf  OF  -t-  BU  +  C 

Suppose  that  X  =  AU^  -I-  BU  C,  where  A,  B,  and  C  are 
constants  such  that  A  =  B^  —  AAC  <  0,  A  >  0,  C  >  0,  and 
U  is  uniformly  distributed  on  [0,  1).  Let  /r  and  i/  be  defined  as 
H  =  -J5/2A  and  r-  =  -A/4A.  The  case  when  0  <  m  <  1/2 
is  illustrated  in  Fig.  6.  The  random  variable  X  then  has  the 
probability  density  function  fx{x)  given  by  the  following: 

1)  If  Ai  <  0,  then 


y/AAx  +  A  ’ 

2)  If  ft  =  0,  then 

"  2^A{x  -  C)  ’ 

3)  If  0  <  M  <  1/2.  then 
2 


fx{x)  =  < 


y/AAx  +  A  ’ 

1 

I,  \/AAx  -t-  A  ’ 


foiC  <x<A  +  B  +  C. 

(49) 

for  C  <  X  <  A  -1-  C.  (50) 

for  <  X  <  C 

for  C  <  X  <  A  +  B  +  C*. 

(51) 


4)  If  At  =  1/2,  then 


fxix)  =  { 


\jAAx  -1-  A  ’ 
1 

I  -B' 


for  1/  <  X  <  C 

for  X  =  C. 


(52) 
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5)  If  1/2  <  /i  <  1,  then 
2 


/a'W  =  { 


y/AAx  +  A 
1 

I  y/4Ax  +  A 


,  for  1/  <  X  <  A-\-  D  +  C 
,  for/l  +  B  +  C<a;<C. 


(53) 


6)  If  /X  >  1,  then 


/x(^)  = 


1 


, _ ~,forAAB-fC<x<C.  (54) 

V4Ax  4-  A 

The  distribution  function  of  X,  Fx(a:),  is  given  by  the 
following: 

1)  If  <  0,  then 

Fxix)  =  ^(V4j4x  +  A  -  jSV  forC  <  X  <  A+B+C. 
2A  \ 

2)  If  /X  =  0,  then 

Fx(x)  =  forC  <x<A  +  a 

^  ^  (56) 

3)  If  0  <  /X  <  1/2,  then 

Fx{x)  = 

4v'4i4x  +  A,  for  1/  <  X  <  C 

—  ^•\/4Ax  +  A  -  -S) )  for  C  <  X  <  A  +  B  +  C. 
^  (57) 


4)  If  /X  =  1/2,  then 

Fjv(a;)  =  •lV4Ax  +  A,  for  ix  <  x  <  C.  (58) 

Jx 

5)  If  1/2  <  /X  <  1,  then 


Fx{x)  = 

2 

A 

J_ 

2A' 


for  i'<x<A+B+C 


\/4Ax+A, 

I  ^(n/4Ax+A+2A+b),  for  A+B+C<x<C. 


(59) 


6)  If  /X  >  1,  then 

Fx{x)  =  ^\/4Aa;  +  A  +  2^  +  , 

for  >1  +  B  +  07  <  X  <  (7.  (60) 
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Asynchronous  Multiple-Access  Interference 
Suppression  and  Chip  Waveform  Selection 
with  Aperiodic  Random  Sequences 

Tan  F.  Wong,  Tat  M.  Lok,  and  James  S.  Lehnert 


Abstract— K  linear  decentralized  receiver  capable  of  suppress¬ 
ing  multiple-access  interference  (MAI)  for  asynchronous  direct- 
sequence  code-division  multiple-access  (DS-CDMA)  systems  with 
aperiodic  random  signature  sequences  is  proposed.  Performance 
bounds  on  this  receiver  are  also  obtained.  Using  them  as  per¬ 
formance  measures,  the  problem  of  chip  waveform  selection  in 
DS-CDMA  systems  with  the  proposed  receiver  under  the  near-far 
scenario  is  investigated.  In  particular,  the  performance  of  several 
practical  chip  waveforms  is  compared.  An  LMS-type  adaptive 
algorithm  is  developed  to  obtain  the  parameters  needed  in  the 
receiver,  which  only  requires  the  signature  sequence  and  coarse 
timing  information  of  the  desired  user. 

Index  Terms — Adaptive  signal  detection,  chip  waveform,  code 
division  multiaccess,  interference  suppression,  spread  spectrum. 


I.  Introduction 

Many  existing  multiuser  detection  methods  (see  [1] 
and  [2]  for  a  survey  of  the  literature)  for  direct- 
sequence  code-division  multiple-access  (DS-CDMA)  systems 
are  developed  under  the  periodic  sequence  assumption,  in 
which  the  signature  sequences  employed  to  spectrally  spread 
the  data  sequences  are  periodic  with  periods  equal  to  one  or, 
at  most,  a  few  data  symbol  durations.  In  practice,  there  are 
systems,  such  as  IS-95  [3],  which  employ  signature  sequences 
with  very  long  periods.  In  fact,  in  IS-95  [3],  the  period  of 
the  signature  sequences  is  so  long  that  the  sequences  can 
be  regarded  as  aperiodic.  For  systems  with  aperiodic  (or 
very  long)  signature  sequences,  the  aforementioned  multiuser 
detection  methods  may  not  work  well. 

In  this  paper,  we  examine  the  problem  of  multiuser  com¬ 
munications  via  the  single-user  detection  point  of  view,  i.e., 
we  treat  signals  from  other  users  as  interference.  To  this  end, 
we  investigate  multiple-access  interference  (MAI)  suppres¬ 
sion  for  DS-CDMA  systems  with  random  aperiodic  signature 
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sequences.  Moreover,  we  ussume  th^t  users  in  (he  system  com¬ 
municate  in  an  asynchronous  manner.  Details  of  the  system 
model  are  given  in  Section  II.  In  Section  III,  we  develop  a 
linear  minimum-mean-squared  error  (MMSE)  receiver,  which 
is  closely  related  to  the  receivers  in  [4]-[6]  (see  Section  III 
for  a  discussion  on  the  similarities  and  differences).  The 
linear  MMSE  receiver  changes  every  symbol  period  due  to 
the  fact  that  the  signature  sequences  are  aperiodic.  To  reduce 
complexity,  we  simplify  the  linear  MMSE  receiver  to  make 
it  take  the  form  of  conventional  matched  filter  followed  by 
a  continuous-time  correlator,  which  is  time-invariant.  Aiming 
further  toward  a  practical  implementation,  we  approximate  the 
continuous-time  correlator  in  the  receiver  by  q  discrete-time 
correlator.  The  resulting  receiver  turns  out  to  be  a  special  case 
of  the  one  considered  in  [7].  Because  of  this,  the  development 
in  Section  III  can  be  viewed  as  a  justification  for  the  receiving 
structures  in  [7]. 

The  key  advantage  of  this  receiver  is  its  implementational 
simplicity.  We  will  demonstrate  in  Section  V  that  it  is  practi¬ 
cally  sufficient  to  sample  the  output  of  the  matched  filter  a  few 
times  and  pass  the  samples  to  the  discrete-time  correlator.  The 
major  complexity  burden  of  the  receiver  lies  in  determining 
the  weights  in  the  correlator.  Therefore,  the  smaller  the  number 
of  samples  taken,  the  simpler  is  the  receiver.  Second,  adaptive 
algorithms  can  be  employed  to  determine  correlator  weights. 
In  Section  VI,  we  develop  an  LMS-type  adaptive  algorithm 
to  do  the  job. 

Unlike  the  abundant  amount  of  work  on  MAI  rejection, 
there  is  much  less  work  concerning  the  problem  of  chip  wave¬ 
form  selection  in  DS-CDMA  systems.  The  chip  waveform 
selection  problem  for  the  conventional  matched  filter  receiver 
is  investigated  in  [8H10].  The  problem  of  chip  waveform 
selection  with  the  noise-whitening  matched  filter  receiver  is 
also  addressed  in  [4]  and  [5].  In  this  paper,  we  investigate 
the  chip  waveform  selection  problem  for  the  MMSE  receiver 
proposed  under  the  near-far  scenario.  As  a  starting  point,  we 
define  two  performance  measures  to  evaluate  the  performance 
of  the  receiver,  namely,  the  signal-to-noise  ratio  (SNR)  and  the 
near-far  efficiency  (NFE),  in  Section  IV.  The  former  measures 
the  average  SNR  in  the  decision  .statistic  given  out  by  the 
receiver.  The  latter  indicates  the  “average  robustness”  of  the 
receiver  in  combating  MAI.  We  also  develop  performance 
bounds  on  the  receiver  proposed  by  considering  the  simple 
case  of  a  system  with  just  a  single  interferer.  Using  these 
bounds,  we  examine  the  effect  of  the  chip  waveform  on 
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the  performance  of  the  receiver  considering  factors  such 
as  bandwidth  efficiency,  MAI  suppression  capability,  and 
robustness.  In  particular,  we  consider  several  common  sample 
chip  waveforms  and  evaluate  their  performance  with  the 
receiver  proposed.  We  see  that  the  results,  although  sharing 
some  similarities  with,  are  significantly  different  from  those 
in  [4],  [5],  [9].  and  [10],  These  similarities  and  differences  are 
explained  in  Section  V. 

II.  System  Model 

In  this  section,  we  describe  the  model  of  the  DS-CDMA 
system.  We  assume  that  there  are  K  +  1  simultaneous  users  in 
the  system.  The  fcth  user,  for  0  <  fc  <  if,  generates  a  sequence 
of  data  symbols  bfK  We  assume  that  the  data  symbols  bj 

are  random  variables  with  =  1. 

The  fcth  user  is  provided  a  randomly  generated  signature 

sequence  given  by 

=  (1) 


where  y{t)  and  n{t)  represent  the  desired  signal  and  the 
interference  components,  respectively.  The  desired  signal  con¬ 
tribution  y{t)  is  given  by 

cx> 

=  E  bt'hf'v-ny  (5) 

l=— OO 

The  interference  n{t)  is  the  sum  of  the  AWGN  component 
nw{t)  and  the  MAI  contribution  n/(t),  which  is  given  by 

K 

*;=!  i=-oo 

In  (5)  and  (6),  the  ith  symbol  waveform  of  the  fcth 

interferer,  is  given  by 

(()  =  E  o) 

3=0 

and  (pk  =  {oJcTk  +  9k)  models  the  phase  angle  of  the  fcth 
user’s  signal. 


The  elements  are  independent  random  variables  such  that 
=  0  and  =  1.‘  The  sequence 

is  used  to  generate  the  spectrally  spread  signal  given  by 

«(*)=  E  ''' 

CO 

where  Tc  is  the  chip  separation  and  is  the  common 
chip  waveform  for  all  signals.  The  symbol  duration  is  T  — 
iVTc,  where  N  is  the  number  of  chips  per  symbol  interval 
(the  processing  gain).  The  chip  waveform  is  assumed 
to  satisfy  the  Nyquist  criterion  to  minimize  the  interchip 
interference  and  is  normalized  so  that 
Moreover,  we  assume  that  the  signature  sequence  of  any  user 
is  independent  of  all  other  signature  sequences  and  all  the 
data  symbol  sequences. 

The  transmitted  signal  for  the  fcth  user,  for  0  <  fc  <  if , 
can  be  expressed  as 

Re[\/2^afc(f-Tfc)  exp(ja;c(f-Tfc))]  (3) 

where  Pk  is  the  power  for  the  fcth  signal,  Wc  is  the  earner 
frequency,  and  Tk  is  the  delay  that  models  the  asynchronous 
system.  We  show  later  that  it  suffices  to  assume  Tk  €  [0,  Tc), 
for  0  <  k  <  K. 

Without  loss  of  generality,  we  consider  the  signal  from  die 
zeroth  user  as  the  communicating  signal  and  the  signals  from 
all  other  users  as  interfering  signals  throughout  the  paper. 
We  assume  that  the  communication  channel  is  corrupted  by 
additive  white  Gaussian  noise  (AWGN)  with  two-sided  power 
spectral  density  Wo/2. 

Then  the  signal  received  at  the  intended  receiver  in  complex 
baseband  notation  is  given  by 

r{t)  =  y{t)  -h  n{t)  (4) 

'  BPSK  and  QPSK  spreading.?  are  examples  under  this  general  model. 


in.  Receiving  Structure 

We  assume  that  we  have  achieved  synchronization  with  the 
desired  user  signal.  Hence,  we  ma}'  assume  To  =  0.  Later, 
we  release  this  perfect  synchronization  assumption  to  model 
timing  mismatch  in  the  system. 

First,  we  consider  the  linear  MMSE  estimator  of  bj  ^  given 

the  sequence 

Sf  =  r  g)mt)dt.  (8) 

J  —OO 

By  the  orthogonality  principle  [11],  gj{t)  is  given  by  the 
solution  of  the  following  integral  equation: 

roo 

Eo^bf^*r{t)\=  J  Eo[r{t)r*{s)]gj{s)ds  (9) 

where  Eo[‘]  denotes  the  conditional  expectation  given  We 

consider  the  form  of  solution,  assuming  it  exists,  defined  by 

/OO 

gj{s)hf\t  -  s)  ds  (10) 

-OO 

and  let 

/OO 

r{s)h^j^"is  -t)ds.  (11) 

-OO 

Then 

/OO 

5*(i)f,(f)dt  (12) 

-OO 

where  gj{t)  is  the  solution  of 

Eo[bf^*fj{t)\  =  J  Eo[fjit)fj{s)]gj{s)ds.  (13) 

Physically,  it  means  that  we  take  observations  after  sequence- 
matched  filtering  (despreading). 
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ro) 

Fig.  1.  Linear  receiver  for  the  jth  data  symbol. 

At  the  receiver,  we  assume  the  signature  sequence  of  the 
desired  user  is  available.  Hence.  is  known.  From  (13), 

we  see  that  gj(s)  changes  with  j.  This  means  that  we  have  to 
solve  (13)  each  time  for  a  new  symbol.  It  is  easy  to  see  that 
the  expectation  [with  respect  to  ]  of  each  of  the  conditional 
expectations  in  (13)  is  not  a  function  of  j.  In  order  to  reduce 
the  computational  complexity,  we  average  over  all  the  desired 
user  sequences^  and  consider  the  following  linear  estimator  of 


(14) 


(15) 


Fig.  2.  Simplified  linear  receiver  for  the  jlh  data  symbol. 


and 


s)  = 


(21) 


/oo 

-oo 

The  filter  g{t)  in  (14)  is  the  solution  of 

tp{t)=  f  Rfit,  s)g{s)ds 

where  ^(t)  =  E[bf^*fj{t)]  and  Rf{t,  s)  =  E[fj{t)f*j{s)]. 
The  corresponding  receiver  is  shown  in  Fig.  1.  We  notice  that 
only  the  matched  filter  which  we  know  explicitly, 

changes  every  symbol  period.  Other  parts  of  the  receiver 
remain  fixed.  Of  course,  this  receiver  is  suboptimal  compared 
to  the  linear  MMSE  receiver  described  above. 

Since  g{t)  does  not  depend  on  j,  we  consider,  without  loss 

of  generality,  the  detection  of  the  symbol  6o 
onward.  From  (4)  to  (7)  and  (11),  we  obtain 

fo{t)  =  y(t)  +  «(*) 

where  n{t)  =  n/(t)  +  nwit)  with  the  despread  components 
indicated  by  tite  tilde  symbol.  We  can  easily  evaluate  the  terms 
needed  in  order  to  solve  (15) 

^{t)  =  (17) 

and 

Rr{t,  S)  =  Ryi^i  S)  +  Rn{t,  s)  (18) 

where^  Rn{t,  s)  =  Rhri^^ 

Ryi^i  5) 

+  ^  ^  ^{t-jTcW{s-jTc),  (19) 


J=-oo 


Rhiit,  s) 


The  function  t^(  )  is  the  autocorrelation  of  the  chip  waveform 
defined  by 

=  ^  r  i’{s)r{s-t)ds.  (22) 

•f  C  J  — OO 

To  satisfy  the  Nyquist  criterion,  we  need  to  have  ^(0)  =  1 

and  '^{jTc)  =  0  for  j  ... 

Above,  we  have  implicitly  assumed  that  the  sums  indexed 
by  j  in  (19)  and  (20)  converge.  A  sufficient  condition  for  this  is 
that  dt  <  00,  which  we  will  assume  throughout  the 

rest  of  Sis  paper.  Therefore,  when  N  is  large,  we  can  neglect 
[7]  the  effect  of  the  interchip  interference  terms  (the  sums 
indexed  by  j)  in  (19)  and  approximate  Ry{t,  s)  by 
Also  it  is  obvious  from  (20)  that  we  can  assume  Ti,  ••• ,  Tk 
lie  in  the  interval  [0,  Tc).  since  hi{t)  is  cyclostationary  with 
period  Tc.  With  perfect  knowledge  of  {Ti,  •  •  • ,  Tr-},  we  can 
solve  the  integral  equation  (15),  provided  a  solution  exists.  We 
notice  that  the  values  of  {4>o,  4>u  ■  ■  ■ ,  <Pk}  do  not  affect  the 

solution.'*  .  J  • 

However,  we  seldom  have  this  knowledge  beforehand  in 
practice.  Moreover,  the  number  of  interferers  K  and  Tfc 
may  vary  slowly  in  time.  As  a  result,  the  continuous-time 
receiver  described,  though  simple,  is  hardly  of  any  practical 
interest.  We  observe  that  the  matched  filter  part  of  the  receiver 
is  known.  Hence,  the  major  difficulty  of  implementing  the 
receiver  is  to  find  g^t),  the  solution  of  (15).  As  a  first  step  to 
improve  the  practical  applicability  of  the  receiver,  we  impose 
two  simplifications:  observe  the  output  of  the  matched  filter 
for  a  finite  duration  and  sample  and  weight  the  output 
of  performing  the  continuous-time  correlation.  The  simplifi^ 
receiver  is  basically  a  special  case  of  the  one  considered  in 

[7],  which  is  shown  in  Fig.  2. 

More  precisely,  if  we  sample  the  output  of  the  matched  mter 
every  T,  seconds  at  jT,  +  7b  and  let  [g],-  =  T,g{jTs  +  To). 
(R,b  R,liT.  +  To,  jT.  +  To),  Ifob  =  foOT.  +  To),  and 
Wj  =  +  To),  then  (14)  and  (15)  become 

i 


(23) 


^  Tp{t-jTc-Tk)r{s-jTc-n) 


and 


fc=l  j=-oo 


]i  = 


(24) 


(20) 

^This  approximation  is  similar  to  the  single-chip  filter  in  [5]. 

Jpor  BPSK  spreading,  an  extra  term  jr  ~  + 

-  jTr)  should  be  added  to  the  expression  in  (19). 


“For  QPSK  spreading,  the  phase  information  has  no  . 

cancellation.  However,  this  is  not  tnie  for  the  ca.se  of 
we  neglect  the  phase  information  and  sacrifice  a  little  performance  gai 

generality. 
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where  the  summations  range  over  the  finite  number  of  samples 
taken.^  Above,  the  offset  To  is  introduced  to  model  the  timing 
error  in  synchronizing  the  desired  user  signal.  Usually,  To  is 
small  compared  to  T^,  but  we  do  not  know  its  value.  Here, 
we  assume  that  To  6  [— Tf;/2,  T(./2).^  We  develop  an  adaptive 
algorithm  in  Section  VI  to  solve  (24)  without  any  knowledge 
of  {To,Ti....,T/,'}. 

We  note  that  (24)  always  has  a  unique  solution,  as  the  matrix 
Rn  [the  sample  version  of  Rn{t.  cS*)]  is  positive  definite  under 
the  assumptions  stated  before.  In  fact,  let  Qn  be  the  inverse 
of  Rii.  Then,  by  the  matrix  inversion  lemma,^ 

'  J 

where 

i  3 

As  mentioned  in  Section  I,  the  MMSE  receiver  is  closely 
related  to  the  receivers  in  [4]~[6].  Here,  we  discuss  the 
differences  between  them  in  detail.  The  major  difference 
between  [4],  [5],  and  the  above  MMSE  receiver  is  in  the 
way  in  which  the  delays  of  the  interferers  are  modeled.  In 

[4]  and  [5],  these  delays  are  uniformly  distributed  random 
variables.  In  our  development,  they  are  treated  as  deterministic 
constants.®  These  two  different  models  lead  to  many  different 
interpretations  and  results.  For  example,  the  model  in  [4]  and 

[5]  implies  that  the  MAI  is  stationary  Gaussian.  This  is  a 
reasonable  approximation  for  the  cases  considered  in  [4]  and 
[5],  where  Aere  are  a  large  number  of  equal-power  users. 
However,  this  assumption  is  not  accurate  for  the  near-far 
situations  (there  are  a  few  strong  interferers)  considered  in 
this  paper.  By  conditioning  on  the  delays  of  the  interferers,  we 
know  [12]  that  it  is  reasonable  again  to  approximate  the  MAI 
(after  the  despreading)  as  Gaussian,  with  which  MMSE  and 
maximum  SNR  are  reasonable  optimization  criteria.  However, 
the  MAI  is  cyclostationary,  instead  of  stationary,  now.  The 
noise-whitening  development  in  [4]  cannot  be  applied  directly, 
although  a  similar  development  can  be  obtained  [6]. 

When  there  are  a  large  number  of  interferers  with  equal 
powers,  (20)  approaches  (17)  in  [5]  by  the  law  of  large 
numbers.  Loosely  speaking,  our  model  reduces  to  the  one 
in  [4]  and  [5]  by  averaging  out  the  delay  information,  and 
our  receiver  reduces  to  the  single-chip  filter  suggested  in  [5]. 
However,  whether  this  averaging  is  a  reasonable  thing  to  do 
depends  on  the  channel  conditions.  In  fact,  we  will  show  that 
no  MAI  suppression  is  possible  when  all  the  interferers  are 
chip-synchronous  with  the  desired  user.  Hence,  the  noise¬ 
whitening  receiver  in  [4]  and  [5]  may  degrade  the  SNR 
performance  when  there  are  only  a  few  strong  interferers.  On 

^  For  convenience,  we  may  index  matrices  with  negative  indexes. 

^There  can  be  several  possible  interpretations  for  this  assumption.  For 
example,  it  models  the  case  in  which  we  can  only  establish  a  coarse 
.synchronization. 

^Assuming  all  the  required  invertibilities,  (A  —  BDC)”'  =  A”'  -f 
A-‘B(D-‘  -CA-'B)-'CA-‘. 

*  Equivalently,  the  development  in  Section  III  can  be  interpreted  as  condi¬ 
tioning  on  these  delays. 


the  other  hand,  our  model  (delays  are  deterministic)  is  accurate 
for  near-far  cases.  Based  on  this,  we  believe  that  our  receiver 
is  more  suitable  than  the  receiver  in  [4]  and  [5]  in  near-far 
situations. 

The  model  assumed  in  [6]  is  a  mixture  of  our  model  and 
the  model  in  [4]  and  [5].  The  locked  users  in  [6]  correspond  to 
modeling  the  delays  as  deterministic  constants,  while  the  un¬ 
locked  users  [6]  correspond  to  modeling  the  delays  as  uniform 
random  variables.  In  this  sense,  our  model  can  be  viewed  as 
a  special  case  of  the  model  in  [6].  The  approach  employed  in 

[6]  is  similar  to  that  used  in  [4],  namely  maximizing  the  SNR 
and  solving  the  corresponding  integral  equation.  The  powers 
and  delays  of  the  interferers  are  needed  in  order  to  solve  the 
required  integral  equation.  Although  the  results  in  [6]  can  be 
generalized  for  aperiodic  sequences  (periodic  sequences  are 
assumed),  a  different  integral  equation  must  be  solved  for 
every  symbol  and  adaptive  implementation  is  difficult.  By 
sampling  and  vectorizing  the  output  of  the  despreader  per 
symbol  as  described  above,  the  noise  process,  when  averaged 
over  all  desired  user  signature  sequences  and  viewed  as  a 
vector-valued  process,  becomes  stationary,  which  is  important 
for  adaptive  implementations.  Our  result  can  be  viewed  as 
a  practical  extension  of  the  receiver  in  [6]  for  aperiodic 
sequences. 


IV.  Performance  Measure 


To  evaluate  the  effectiveness  of  the  receiver  under  various 
conditions,  we  define  a  performance  measure  based  on  the 
average  signal-to-noise  ratio  in  the  decision  statistic  at  the 
output  of  the  receiver 

2 


It  can  be  shown  [7]  that  the  solution  of  (24)  maximizes  the 
SNR  defined  above  over  all  possible  choices  of  g  ^  0. 
Moreover,  we  can  show  (see  Appendix  A)  that  SNR  <  70, 
where  70  =  PqT/Nq  is  exactly  the  SNR  when  there  is  no 
interferer  in  the  perfectly  synchronized  (To  =  0)  system. 

The  “near-far  resistance”  defined  in  [1]  and  [13]  which 
measures  the  robustness  of  Ihe  receiver  in  a  multiple-access 
environment  under  the  periodic;  deterministic  sequence  as¬ 
sumption  is  not  applicable  in  our  case.  Since  we  consider 
the  average  performance  when  we  develop  our  receiver,  it  is 
reasonable  to  have  a  performance  measure  which  indicates  the 
“average  robustness.”  In  light  of  this,  we  define  the  “near-far 
efficiency”  (NFE)  as 


NFE  =  inf{SNR/7o:  ft  >  0,  ^  -  1,  2,  •  •  • ,  AT}  (28) 


where  SNR  is  defined  in  (27).  We  note  that  0  <  NFE  <  1. 
Moreover,  samples  of  ro(^)  exhibit  Gaussian-like  behavior 
when  N  is  large  [12].  This  also  justifies  the  use  of  SNR. 

We  see  that  the  SNR  and  NFE,  defined  above,  are  functions 
of  To,  Ti,  •  •  • ,  T/c.  Sometimes,  it  is  convenient  to  have  per¬ 
formance  measures  that  are  independent  of  them.  To  do  this. 
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we  model  the  delays  as  uniform  random  variables  on  [0,  Tc)^ 
and  define  the  “average  signal-to-noise  ratio”  and  average 
near-far  efficiency”  as 

ASNR  =  Tk  [SNR]  (29) 


and 


ANFE  =  Tk  [NFE]  (30) 


where  £^To,t,,  -,TkI-]  denotes  that  expectation  is  with  respect 
to  To,  Ti,  •••,  Tie- 

The  general  expression  of  the  SNR  is  too  complex  to  give 
us  any  insight  on  the  design  of  the  receiver.  Instead,  we  study 
in  detail  the  simple  system  with  two  users,  namely,  a  desired 
user  and  an  interferer.  It  turns  out  that  the  expression  for  the 
least  upper  bound  on  the  SNR  for  this  case  is  clear  and  simple 
and  can  be  used  as  a  guide  in  designing  the  receiver. 

First,  define  the  function 


SNRoo(Ti)  =  70 


Ai 

1  +  Ai 


^  |^(/Tc  +  Ti)|2 

l=z-00 


(31) 


where  70  =  PoT/No  and  Ai  =  PiT/{NNo).  It  is  shown 
in  Appendix  A  that  SNRoo  is  the  least  upper  bound  on  the 
SNR  achievable  by  any  discrete-time  receiver  configuration 
described  in  Section  HI,  even  if  the  sampling  times  are  not 
uniform.  We  note  that  SNRoo  is  not  a  function  of  Tq.  This 
means  that  the  effect  of  the  timing  error  can  be  neglected  if 
we  sample  fine  and  long  enough.’®  From  (31),  it  is  obvious 
the  function  defined  as 


NFEoo(Ti)  =  1  -  E 

l=—oo 

is  the  least  upper  bound  on  the  NFE  of  any  receiver  under  the 
single-interferer  assumption.  Similarly,  we  can  see  that 


ASNRoo  -  70  1  ^  J  ^  ^  (/) 


(33) 


and 

1  . 

ANFEoo  =l-7f7  ^^(/) 

J-C  J-oo 

are  the  least  upper  bounds  on  the  ASNR  and  ANFE  of  any 
receiver  under  the  single-interferer  assumption.  In  (33)  and 
(34),^(/)  is  the  Fourier  transform  of 

pTnally,  we  point  out  that  the  performance  of  the  system  will 
be  worse  if  more  and  more  interferers  are  added  to  the  system. 
It  can  be  shown,  by  augmenting  the  arguments  in  Appendix 
A,  that  (31)-(34)  provide  upper  bounds  on  corresponding 
measures  of  any  receiver  under  all  system  conditions  in  this 
paper. 


TABLE  I 

Bandwidths  (in  1/Tr  Hz)  and  ANFE„o  of  Different  Chip  Waveform.? 


Cliip  waveform 

(a) 

(b) 

(c) 

(d) 

(e) 

0.99-BW 

20.57 

2.364 

2.818 

1.996 

4.953 

Noise  BW 

1 

1.234 

1.5 

2 

5 

Half- power  BVV 

0-8859 

1.189 

1.441 

2 

5 

ANFEoo 

0.3333 

0.4134 

0.5189 

0.5062 

0.8010 

Bound  on  aN^Eoo 

'(for  0.99-BW  above) 

0.9524 

0.5853 

0.6522 

0.5091 

0.8021 

V.  Chip  Waveform  Selection 

From  the  previous  section,  we  know  that  if  we  sample 
fine  and  long  enough,  the  selection  of  the  chip  waveform  is 
the  only  factor  affecting  the  performance  of  the  system  that 
we  are  able  to  control.  The  effect  of  the  chip  waveform  on 
the  performance  of  the  system  can  be  large.  However,  it  is 
generally  difficult  to  evaluate  this  effect,  as  the  dependence 
of  the  system  performance  on  the  chip  waveform  is  usually 
very  complicated.  Since  ASNRoo  3^*1  ANFEoo  provide  upper 
bounds  on  the  performance  of  any  receiver  and  they  depend 
on  the  chip  waveform  in  a  simple  and  clear  fashion,  we  can 
use  them  to  measure  the  effectiveness  of  a  chip  waveform. 

In  this  section,  we  consider  five  sample  chip  waveforms 
and  evaluate  the  performance  of  various  receivers  employing 
them.  The  five  chip  waveforms  are:  a)  the  rectangular  pulse, 
b)  the  half-sine  pulse,  c)  the  time-domain  raised-cosine  pulse, 

d)  the  frequency-domain  raised-cosine  (2,  0.1)  waveform,  and 

e)  the  frequency-domain  raised-cosine  (5,  0.1)  waveform  (see 
Appendix  B).  We  notice  that  a)-c)  are  time-limited,  while  d) 
and  e)  are  band-limited.  Their  chip  waveform  functions  are 
given  [for  d)  and  e),  the  autocorrelation  functions  of  the  chip 
waveforms  are  given  instead],  respectively,  as 

V’a(0  -VtSP) 

ipb{t)  =  V2  sin(7rf/Tc )pTe(0 

ipcit)  =  V^[l  +  cos(27r(f/Tc  -  l/2))]pr.(f) 

7  sin(27rf/Tc)  cos(a-f/10Tc) 

"  27rf/Tc  1  r-  f2/25T2 

j  sin(57rf/Tc)  cos{irt/10Tc) 

“  STTf/Tc  1  -  f2/25T2 

where  pT.(f)  =  1  for  0  <  f  <  T^,  and  ptS*)  =  0’  otherwise. 

The  NFEoo’s  of  chip  waveforms  a)-e)  are  plotted  m  Fig.  3 
as  a  function  of  Ti .  Their  ANFE^,  ’  s  are  given  in  Table  I.  From 
Fig.  3,  we  see  that  there  is  no  gain  when  Ti  =  0,  i.e.,  when  the 
desired  user  is  chip-synchronous  to  the  interferer.  However,  as 
the  asynchrony  between  them  increases,  the  NFE  is  larger  than 
zero,  which  implies  that  the  receiver  is  resistant  to  the  near-far 
problem.  On  the  average,  we  see  that  the  frequency-domain 
raised-cosine  waveform  e)  and  the  rectangular  pulse  a)  give 
the  largest  and  the  smallest  ANFEoo,  respectively,  among  the 
five  chip  waveforms. 


’Here,  we  model  the  delays  as  random  variables  only  to  get  simpler 
performance  measures.  The  optimal  filtering  is  still  performed  under  the 
assumption  of  deterministic  delays. 

*®This  property  is  similar  to  the  timing-error-resistance  property  of  a 
fractionally  spaced  equalizer. 


A.  Bandwidth  Consideration 

From  (32).  for  any  e  >  0,  we  can  choose  a  chip  waveform 
such  that  NFEoo(Ti)  =  1  for  all  Ti  €  [e,  -  e].  Hence, 
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we  can  make  ANFEoc  arbitrarily  close  to  one  by  choosing 
suitable  chip  waveforms.  This  is  very  desirable  in  terms 
of  MAI  suppression.  However,  in  order  to  make  ANFEqo 
smaller,  we  have  to  increase  the  system  bandwidth,  which 
is  undesirable  in  terms  of  bandwidth  efficiency.  Since  no 
notion  of  bandwidth  is  employed  in  defining  ANFEoo  and 
ASNRoo»  it  is  not  very  meaningful  to  directly  compare  chip 
waveforms  using  these  two  performance  measures  unless  we 
restrict  ourselves  to  a  set  of  chip  waveforms  with  equal 
bandwidths.  To  do  so,  we  need  to  define  the  bandwidth  of 
a  DS-CDMA  system.  The  bandwidth  of  the  power  spectral 
density  of  the  communicating  signal  is  usually  taken  as  the 
bandwidth  of  a  digital  communication  system.  Based  on  the 
assumptions  we  made,  it  is  not  hard  to  see  the  power  spectral 
density  of  the  multiple-access  signal  is  just  a  scaled  version  of 
$(/).  For  a  power  spectral  density  which  is  not  band-limited, 
there  is  no  unique  definition  of  bandwidth.  In  fact,  there 
can  be  several  different,  but  meaningful,  definitions  [15].  For 
our  cases,  Table  I  summarizes  several  different  bandwidths  of 
systems  employing  the  five  chip  waveforms  described  above. 

For  simplicity,  we  pick  the  a-fractional  containment  band¬ 
width  (a-BW)  [15]  as  our  working  definition  of  bandwidth.  We 
can  derive  a  bound  on  ANFEoo  given  the  a-BW  of  a  system. 
Suppose  the  a-BW  of  the  system  is  Wa,  i.e.,  a  fraction  of 
all  the  signal  power  is  contained  inside  [-VFa/2,  W(^/2].  We 
have 

ANFEc.  <  1  -  (35) 

We  note  that  the  lowest  possible  value  of  is  n^lTc,  since 
there  is  no  chip  waveform  with  Wa  <  a^/Tc  that  satisfies  the 
Nyquist  criterion.  The  proof  of  (35)  is  given  in  Appendix  B. 

The  bounds  for  the  waveforms  having  the  same  0.99-BW 
as  each  of  the  five  sample  chip  waveforms  are  also  given 


in  Table  I.  We  see  that  the  ANFEoo ’s  of  chip  waveforms  d) 
and  e)  are  very  close  to  their  corresponding  bounds.  This 
implies  that  these  chip  waveforms  are  close  to  the  optimal 
ones  in  the  sets  of  legitimate  chip  waveforms  with  the  given 
bandwidth  limitations.  On  the  other  hand,  the  ANFEoo  of  tho 
rectangular  pulse  is  much  smaller  than  the  bound  in  (35). 
Hence,  the  rectangular  pulse  is  far  from  the  optimal  one  in 
the  corresponding  set  of  chip  waveforms.  Optimization  of 
ANFEoo  under  the  a-BW  constraint  is  difficult  in  general. 
However,  under  some  special  cases,  we  can  employ  the 
frequency-domain  raised-cosine  waveforms  to  achieve  the 
bound  in  (35)  as  closely  as  desired.  A  detailed  discussion  of 
this  is  given  in  Appendix  B. 

We  mention  that  we  can  improve  the  performance  of  a 
system  by  suitably  choosing  a  chip  waveform  to  increase  the 
system  bandwidth.  A  chip  waveform  with  a  larger  bandwidth 
is  usually  more  localized  in  time.  With  such  a  chip  waveform, 
we  can  pack  more  chips  into  a  symbol.  It  is  natural  to  ask 
whether  it  is  better  to  increase  the  number  of  chips  per  symbol 
(decrease  the  chip  separation  Tc)  together  with  the  bandwidth. 
We  can  employ  the  bound  in  (35)  to  answer  this  question 
when  optimal  chip  waveforms  are  employed.  From  (35),  we 
see  that  the  ANFEoo  bound  increases  with  the  bandwidth-chip- 
separation  product  W^Tc.  Any  increase  in  the  product  W^Tc 
due  to  the  increased  bandwidth  will  be  offset  by  the  decrease 
in  the  chip  separation.  Therefore,  it  is  better  to  just  simply 
increase  the  bandwidth  while  keeping  the  number  of  chips  per 
symbol,  N,  fixed  in  near-far  cases.  We  point  out  that  there 
are  two  caveats  to  the  interpretation  above.  First,  N  has  to 
be  large  for  the  argument  to  be  valid.  Hence,  we  cannot,  say, 
reduce  N  to  one  (no  spreading)  in  order  to  get  all  the  advantage 
from  MAI  cancellation.  Second,  the  argument  is  valid  only  for 
near-far  cases.  When  there  are  a  large  number  of  interferers 
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TABLE  II 

ASNR  (IN  dB)  OF  Different  Sampling  Schemes 


Chip  waveform 

(a) 

(b) 

(c) 

(cl) 

(e) 

ASNR«= 

1.5..T35 

16.240 

17.201 

17.096 

19.050 

r.  =  7;,  [0,0] 

2.9096 

4.4554 

7.4020 

10.580 

15.051 

T.  =  Tc,  \-Tc,Tc] 

4.3126 

5.3997 

8.0678 

10.586 

15.054 

T,  =  Tc/2,\-Tc,Tc]  ^ 

13.6.63 

1.6.846 

16.598 

16.187 

17.067 

T.  =  Tc/4,  l-r./2,T./2] 

14.169 

16.207 

17.191 

16.703 

18.120 

T,  =  Tc/4,  [-Tc,Tc] 

14.692 

16.215 

17.197 

16.898 

18.522 

with  equal  powers,  it  is  shown  in  [4]  that  the  performance 
only  depends  on  the  bandwidth.  It  does  not  matter  whether  we 
increase  N  or  not. 


B.  Effect  of  Sampling 

ASNRoo  and  ANFEoo  provide  upper  bounds  on  the 
mance  of  a  system.  In  practice,  we  cannot  sample  too  finely 
or  for  too  long.  Therefore,  the  performance  of  the  system  will 
be  poorer  than  that  predicted  by  these  two  upper  bounds.  To 
give  some  indication  on  how  large  the  degradation  will  be,  we 
consider  five  different  sampling  schemes  for  systems  with  each 
of  the  five  sample  chip  waveforms.  Table  II  summarizes  the 
ASNR  of  each  of  the  sampling  schemes  for  each  of  the  cmp 
waveforms.  In  Table  II,  the  sampling  schemes  are  identified  by 
the  values  of  T*  and  the  observation  intervals.  To  single  out 
the  effect  of  sampling,  we  consider  a  system  with  N  =  127,  a 
single  interferer  whose  power  is  20  dB  above  the  desired  user, 
and  70  =  20  dB.  We  also  assume  perfect  synchronization  is 
achieved,  i.e..  To  =  0. 

The  first  row  of  Table  II  gives  the  ASNR  for  the  con¬ 
ventional  matched  filter  receiver.  We  observe  that  the  ASNR 
improves  greatly  for  chip  waveforms  a)-d),  even  if  we  just 
sample  twice  per  chip.  For  chip  waveforms  b)  and  c),  the 
ASNR  approaches  the  ASNRoo  bound  if  we  sample  four  times 
per  chip  and  have  an  observation  interval  of  %  seconds,  whic 
corresponds  to  just  five  samples. 


C.  Effect  of  Timing  Error 

Since  the  functions  SNRoo  and  NFEoo  do  not  depend  on  To, 
we  know  that  when  we  sample  fine  enough,  the  effect  of  timing 
error  is  minimal.  However,  when  the  sampling  rate  is  not 
high,  timing  error  can  degrade  the  performance  of  the  receiver 
significantly.  To  evaluate  this  effect,  we  reconsider  the  system 
in  Section  V-B,  except  that  To  is  modeled  as  a  uniforn.  random 
variable  on  [-T,/2,  T,/2).  Table  III  shows  the  ASNR  s  for 
several  different  sampling  schemes.  Comparing  the  results 
with  Table  II,  we  see  that  chip  waveforms  b)-d)  are  relatively 
insensitive  to  timing  error,  whereas  the  effect  of  timing  error 
is  significant  for  chip  waveforms  a)  and  e). 


TABLE  ill 

ASNR  (IN  dB)  OF  Different  Sampling  Schemes  with  Timing  Error 


Chip  waveform 

(a) 

(b) 

(c) 

(cl) 

(c) 

T,  =  Tc,  1-rc.Tcl 

1.6441 

2.9005 

4.8503 

7.4035 

7.99.35 

T.  =  Tcl2,  [-T:,T,1 

11.297 

15.312 

15.697 

15.7942 

12.934 

T.  =  7;/4.  [-Tc/2,Tc/^ 

11. ,346 

15.825 

16.942 

16.619  ^ 

16.803 

Ti  =  rjA,  l-Tc,Tc] 

13.602 

16.179^ 

17.189 

16.883 

17.421 

conventional  matched  filter  receiver  is  given  by 

lom)? 


SNR,„/(Ti)  = 


1-f  Ai  53  |V;(/T,-l-Ti)P 

l=-oo 


(36) 


It  is  obvious  from  (36)  that  the  conventional  matched  filter 
receiver  is  sensitive  to  timing  error  in  the  system.  Assuming 
To  =  0,  we  can  obtain  the  ANFE  of  this  receiver 


as 


ANFEm/ =  Y 


|T.e[0,T,):  f;  W(Te  +  TO|^  =  ol 

1  l=-oo  ^ 

(37) 

vhere  I  •  1  denotes  the  Lebesgue  measure  on  [0,  Tc).  It  is 
ihown  in  Appendix  A  that  ANFEoo  >  ANFEm/.  Hence, 
ve  see  that  good  (in  terms  of  large  ANFE)  chip  waveforms 
br  the  conventional  matched  filter  receiver  are  also  good  for 
he  MMSE  receiver  considered  in  Section  III.  However,  the 
'onverse  is  not  true,  as  we  can  see  that  all  five  sample  chip 
ivaveforms  give  zero  ANFE  for  the  matched  filter  receiver^ 
Effectively,  the  MMSE  receiver  acts  to  enlarge  the  set  ot 
chip  waveforms  which  are  robust  in  combating  the  "ear-far 
problem  so  that  more  practical  waveforms,  such  as  the  five 

sample  waveforms,  can  be  used. 

In  [9]  and  [10],  the  problem  of  chip  waveform  selection  for 
the  matched  filter  receiver  is  also  considered.  In  our  notation, 
these  references  use 

(38) 


STIR  = 


7o 


l-h 


h  r  4!^if)df 

■^c  J —oo 


5  the  performance  measure.  We  get  STIR  from  (36)  if  we 
To  =  0  average  just  the  denominator  over  i- 
oughly  speaking.  STIR  can  be  treated  as  an 
f  ASNRm/-  Comparing  (33)  and  (38),  we  stt  that  bo 
nd  ASNRoo  depend  on  the  same  factor  /_^  w  yj)  ]■ 
result,  chTp  waveforms  which  give  large  STIR  a  so  give 
irge  ASNRoo.  However.  STIR  tends  to  zero  as  ‘^rfering 
ower  becomes  very  large,  while  ASNRoo  , 

f  practical  chip  waveforms.  This,  again,  shows  ^vantage 
if  the  proposed  receiver  over  the  conventional  ma  c  e 
eceiver  in  terms  of  combating  the  near-far  problem. 


D.  Conventional  Matched  Filter  Receiver 

It  is  interesting  to  consider  the  problem  of  chip  waveform 
selection  for  the  conventional  matched  filter  receiver  under 
the  framework  developed  in  Section  IV.  The  SNR  for  the 


E.  Multiple  Interferers 

To  indicate  the  performance  of  the  receiver  when  there  are 
multiple  interferers  in  the  system,  we  obtain  the  ASNR 
receivers  with  chip  waveforms  a)  and  d).  Each  interfere 
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Fig.  4.  ASNR  versus  number  of  interferers. 


equal  power,  which  adds  up  in  such  a  way  that  the  total 
interference  power  is  10  dB  above  the  desired  user  power.  All 
other  system  parameters  are  the  same  as  those  in  Section  V- 
B.  The  results  shown  in  Fig.  4  indicate  that  the  gain,  over 
the  conventional  matched  filter  reduces  as  the  number  of 
interferers  increases. 

We  observe  from  Fig.  4  that  the  rectangular  pulse  a)  out¬ 
performs  chip  waveform  d)  when  there  are  a  large  number 
of  equal-power  interferers.  This  indicates  that  ANFEqo  is 
not  a  good  measure  of  performance  for  this  situation.  It  is 
mentioned  in  Section  III  that  the  MMSE  receiver  approaches 
the  single-chip  noise-whitening  receiver  in  [5]  when  there  are 
a  large  number  of  equal-power  interferers.  From  [4],  we  know 
that  the  single-chip  noise-whitening  filter  improves  the  SNR 
performance  of  a  chip  waveform  to  that  of  the  “flat-spectrum” 
signal  with  the  same  bandwidth.  When  the  noise-whitening 
filter  is  employed,  the  performance  depends  on  the  bandwidth 
only.  Since  the  rectangular  pulse  a)  has  a  larger  bandwidth 
than  chip  waveform  d),  according  to  [4],  the  rectangular  pulse 
should  outperform  chip  waveform  d)  when  there  are  a  large 
number  of  interferers.  However,  the  argument  in  [4]  does 
not  apply  when  there  are  only  a  few  strong  interferers  (i.e., 
when  MAI  is  best  modeled  by  a  cyclostationary  process).  In 
such  cases,  bandwidth  alone  is  not  sufficient  and  ANFEqo 
becomes  a  better  performance  measure.  From  Table  I,  a)  has 
a  smaller  ANFEqo  than  d)  and,  hence,  should  have  a  poorer 
SNR  performance  when  there  are  only  a  few  interferers.  This 
is  readily  observed  in  Fig.  4.  Summarizing  the  discussion,  our 
viewpoint  in  selecting  the  chip  waveform  differs  from  that  of 
[4]  and  [5]  in  the  sense  that  we  focus  on  the  near-far  scenario, 
while  [4]  and  [5]  assume  that  there  are  a  large  number  of  equal- 
power  interferers.  As  we  see  from  above,  the  two  different 
viewpoints  lead  to  very  different  conclusions. 


VI.  Adaptive  Implementation 


As  mentioned  in  Section  III,  the  MMSE  receiver  structure 
allows  adaptive  implementations.  In  this  section,  we  can 
construct  an  LMS-type  adaptive  algorithm  to  solve  (24). 

Suppose,  for  the  moment^  we^  know  the  value  of  To  =  0. 
Hence,  we  can  determine  ^/11^||,  since  it  does  not  depend 
on  Po-  For  the  detection  of  the  jth  symbol,  we  sample  the 
despread  received  signal  f{t)  every  Ts  seconds  as  described 
in  Section  III  to  form  the  vector  f  j  of  length,  say,  M.  Then, 
the  following  algorithm  can  be  used  to  solve  (24). 

Set  ao  =  0  and  for  j  >  0,  get  gj  by 


aj  =  Aaj-i  4-  f^f ^ 


(1  -  X)aj 


(39) 

(40) 


where  /x  is  the  step  size  and  0  <  A  <  1  is  chosen  to  smooth 
the  average  energy  estimation  of  rj. 

Strictly  speaking,  the  solution  given  by  this  algorithm  differs 
from  the  solution  of  (24)  by  a  complex  constant  that  has 
no  effect  on  the  SNR  of  the  system.  Because  of  the  same 
reason,  we  do  not  need  to  Icnow  the  phase  of  the  desired  user 
signal.  In  (39),  we  estimate  the  average  energy  of  the  vector 
rj.  The  update  in  (40)  is  the  standard  LMS  update  except 
that  the  observation  vector  f ^  is  normalized  by  its  estimated 
average  norm,  i.e.,  the  operation  of  a  perfect  automatic  gain 
control  (AGC)  is  performed.  This  normalization  is  necessary 
to  avoid  numerical  instability.  We  note  that  the  computational 
complexity  of  the  algorithm  is  of  the  order  of  M  per  iteration. 

If  we  do  not  know  the  value  of  To,  a  standard  strategy 
[14]  is  to  hypothesize  several  different  values  of  To  and  use 
them  to  obtain  different  ^  to  run  several  applications  of  the 
adaptive  algorithm  above  in  parallel.  We  select  the  one  that 
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Fig.  5.  SNR  from  a  sample  nm  of  the  adaptive  algonthm. 

maximizes  the  SNR.  This  requires  us 
from  the  received  signal.  Again,  due  to 
Rj  we  can  have  a  very  efficient  way  to  estimate  the  SNR. 
AftCT  the  sampling  operation  to  get  ij,  we  wait  for  severe 
chip  separations  and  sample  the  output  of  Ae 
M  times  again  to  form  another  vector  vj.  We  point  out  that 
the  time  separation  of  f,-  and  f,-  must  be  an  integral  multiple 
of  Tc  and  large  enough  so  that  almost  no  desired  user  signal 
IT^y  is  contained  in  f,.  THen.  the  following  measure  gives 
a  reasonable  estimate  of  SNR  -I-  1* 

^Ai-lgf(To)f,p 

(rp  \  _  — - - - - -  (41) 

-  j 
i=i 

In  (41)  the  dependence  on  To  is  emphasized,  and  0  <  A  <  1  is 
chLn  to  smooth  out  noise  in  the  measure.  Therefore  can 
simply  choose  the  one  that  maximizes  (Jj  instead  of  the  bN  . 
We  nL  that  aj  can  be  obtained  recursively  with  complexity 
of  order  M  per  iteration. 

To  demonstrate  the  algorithm,  we  consider  the  sample 
system  in  Section  V-B  with  binary  data  and  signature  se¬ 
quences,  the  time-domain  raised-cosine  pulse  c)  as  the  chip 
Taveform  To  =  T./4,  and  T,  =  T./2.  We  sample  twice  per 
Tp  with  the  observation  interval  [-T.,  T.].  We  run  three 
pLallel  applications  of  the  adaptive  algorithm  hypothesizing 

rp _ T  /4  0  and  Tr/4,  respectively.  The  SNR  and  aj  for 

Sch  of  £  .pplicatlons  resulting  front  a  run  of  the  algorithm 

are  shown  in  Figs,  5  and  6.  m  ,®4 

see  that  the  algorithm  with  hypothesis  To  -  TJ4  gives 
the  largest  after  the  algorithm  settles  down.  According 
lo  the  Lcussion  above,  we  can  deduce  from  this  result  that 


To  =  Tc/4.  Referring  back  to  Fig.  5,  we  see  that  the  algorithm 
with  hypothesis  To  =  Te/4  does  converge  to  give  a  go<^  S^R 
perfornwnce,  while  the  algorithms  with  other  hypotheses  give 
much  poorer  SNR  performances. 

VII.  Conclusions 

In  this  paper,  based  on  the  MMSE  principle,  we  have 
developed  a  linear  receiver  capable  of  suppressing  M 
asynchronous  DS-CDMA  systems  with 
seouences  Relying  only  on  the  signature  sequence  and  rough 
STnfoLtion  of 'the  desired  user,  we  have  deeped 
an  LMS-type  adaptive  algonthm  to  obtain  the  weig 
correlator  Moreover,  we  have  obtained  upper  bounds  on  the 
performance  of  this  receiver  based  on  the  special  case  of  a 
Lgle  interferer.  Using  these  upper  bounds  us  Pef 

meLures  we  have  investigated  some  considerations  such  as 
MAI  suppression  capability,  bandwidth  efficb"®/’  “tms  We 
Tess  in  selecting  chip  waveforms  for  DS-CDMA  systems.  We 
have’  found  out  Lt  the  criterion  for  selecting  chip  ^avefoms 
in  the  near-far  scenario  differs  significantly  from  that  for  th 
Cfwtem  them  are  a  large  uumbe,  of  equal-power  ...erferere, 

APPENDIX  A 

in  this  Appendix,  we  show  that  (31)  is  the  least  upper  bound 
on  the  SNR  of  any  receiver  described  in  Section 

:  %^S;te";SNR  of  dyadically  sampled  systems 
.  .ith  time-limited  VHO-  Assume  that  VKO  is 
;  [-LiTr,  L2T,]  with  (Li  >  0,  L2  >  0)-  Then, 
i  limited  to  [-LTr,  L%]  where  L  -  1  +  2-  _  ^ 

r  receiver  which  takes  samples  at  jT,  +  ■‘0  tor  j  v 
;  l)2''t  _(L-fl)2«t  +  l.  --,(L-t-l)2''.wherc7:u  =  ?./^  • 
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Fig.  6.  (Tj  from  a  sample  run  of  the  adaptive  algorithm. 


We  use  the  notation  SNRjw  to  denote  the  SNR  defined  in 
(27)  for  this  receiver.  Then,  we  claim  that,  for  To  =  ioTs  and 

Ti  = /iT„  where /o  G  + 

and  Zi  e  {0,  1,  •••,  2^  -  1} 


SNRm  =  70 


Ai 

1  +  Ai 


(42) 


To  show  this,  observe  that  since  the  chip  waveform  is  time- 
limited,  the  summation  indexed  by  j  in  (20)  has  only  a  finite 
number  of  nonzero  terms.  Then,  we  can  solve  (24)  recursively 
by  the  following  result,  which  is  based  on  repeat  applications 
of  the  matrix  inversion  lemma:  suppose  Ro  is  a  positive 
definite  matrix  Rj  =  Rj_i  -I-  for  Z  =  1,  2,  •  •  • ,  P, 
and  R  =  Rp  -I-  ■  Then  Rj  for  Z  =  0,  1,  •  •  • ,  P  and 

R  are  invertible.  Write  the  inverse  of  R;  as  Q;  where 
Z  =  0,  1,  •  •  • ,  P.  Let  Rg  =  7/  and  7j  =  Then,  for 

Z  =  1,2,  •••,P 


To  initialize  the  recursive  process  above,  we  set  [Roltj  = 
[Rnwltj-  Then,  we  have  the  following  important  equation: 

<«) 

for  i,j  =  -{L  +  1)2^,  •••,  (L  +  1)2*^.  By  (44),  we 
have  7o  =  PoT/Nq.  Next,  we  arrange  the  terms  in  (43) 
as  EiztU  =  +  and  apply  the 

recursive  procedure  on  them  according  to  this  ordering. 
Again,  by  (44) 

[gi]i  = 

Ai  =PiT/{NNo), 

7i=7o[l-3^|^(-iTe-bTi)|"  . 


g/  = 

A/  =^fgj 

1  H 

Qi  =  Q(-i  -  S'S' 

f  I  if  1^ 

71  =71-1  -  iY~Xi  ■ 

Consider  the  dyadic  receiver  described  ^bove  for  i,  j  = 

-(L  -1-^)2^,  (L  +  1)2^,  =  Mx+io.  [Rylii  = 

[t^'].+io[^li+io’  =  (\/2ZVo/\/^)[tA]i-j.  and 


[Ri 


2L+1 

,w=  E 


l=-2L-2 


NPo 


12^^  —li  +/o 


where  [t/iji  =  iJ>{iTs). 


j_12«-l,+lo 

(43) 


Inductively,  we  haye 

[g2/-]i  =  (V  Pi/2ZV  /No^Si^  (z,-i)2M+ii-io 
\2L=PiT/{NNo) 


72Z-  =70 


L-l 


/=-L 


Now,  consider  the  terms  in  the  second  and  third  summa¬ 
tions.  First 


i+(2L+2)2M-l,+lo‘ 


(45) 
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Due  to  the  fact  that  Qo  is  Hermitian  and  is  time-limited 
to  [-LTc,  LTc\.  it  is  easy  to  see  from  (44)  and  (45)  that 

j 

and  hence  72L+1  =  72L-  Inductively,  we  can  show  that,  for 
I  =  2L  +  1,  2L  +  2,  •  •  • ,  4L  +  4 

i 

Therefore,  we  have  74L+4  =  74L+3  =  •  •  •  =  72i.+i  =  72L- 
Finally,  we  see  from  (27)  that  SNR  =  741.+4  ^^d,  hence,  (42) 
is  established.  It  is  also  obvious  that  extending  the  observation 
interval  further  will  not  increase  SNRm- 

Next,  we  proceed  to  show  the  general  case.  In  order 
to  do  that,  we  need  the  following  obvious,  but  important, 
observation.  Suppose,  we  have  two  receivers,  say  Receivers 
A  and  B.  Both  are  constructed  as  in  Section  III,  except  that 
we  allow  the  sampling  times  to  be  nonuniform.  For  fixed  To 
and  Ti.  let  the  SNR’s,  defined  in  (27),  for  them  be  SNRa  and 
SNRb,  respectively.  If  the  sampling  times  of  Receiver  B  are 
a  subset  of  those  of  Receiver  A,  then  SNRa  >  SNRb  due  to 
the  fact  that  the  SNR  is  always  maximized  in  the  construction 
of  the  receiver. 

Moreover,  it  can  be  shown  easily  that  is  unifon^y 
continuous  on  the  real  line.  Let  ti,  tz,  •  •  • ,  fp  be  the  sainpling 
times  of  the  receiver  as  in  Section  HI.  By  the  fact  that  •^{t)  is 
continuous  and  Rfivv  ‘s  positive  definite  under  any  sampling 
configuration,  we  find  that  the  SNR,  defined  in  (27),  is  a 
continuous  function  of  ti,  t2,  ■  ■  • ,  tpy  To,  and  Ti. 

For  the  moment,  continue  to  assume  ip{t)  is  time-limited 
and  reconsider  the  dyadic  receivers  described  above.  For  fixed 
To  and  Ti,  SNRjw  is  an  increasing  sequence.  From  (42), 
SNRat  <  70-  Therefore,  the  sequence  must  have  a  limit.  For 
To/Tc  and  Ti/Tc  being  dyadic  rationals  in  [-1/2,  1/2]  and 
[0,  1],  respectively,  this  limit  is  given  by  (42).  Otherwise, 
since  dyadic  rationals  are  dense  in  both  [—1/2,  1/2]  and 
[0,  1],  and  SNRoo  and  SNRat,  for  all  M,  are  continuous 
functions  of  To  and  Ti,  SNRm  <  SNRoo,  for  all  M.  But 
since  SNRm.  for  all  M,  are  bounded  continuous  functions  of 
To  and  Ti,  limM-oo  SNRm  must  also  be  continuous.  This 
forces  limM-.oo  SNRm  =  SNRoo  for  all  To  and  Ti. 

Now,  let  {fi,  fiz,  •  •  • ,  tp}  be  the  set  of  sampling  times  of 
any  receiver  and  SNRo  be  the  corresponding  SNR.  Due  to  the 
fact  that  the  SNR  is  a  continuous  function  of  h,  t2,  ,  tp, 

there  must  be  a  receiver  with  sampling  times  t'l,  •  •  • ,  tp, 
which  when  divided  by  To  are  dyadic  rationals,  whose  SNR, 
namely  SNR',  is  arbitrarily  close  to  SNRo-  Then,  SNR'  < 
SNRm  <  SNRoo.  where  SNRm  is  the  SNR  of  the  dyadic 
receiver  whose  sampling  time  set  contains  {t'l,  ^2.  ■  ■  ■ .  tp}. 
This  implies  that  SNRo  ^  SNRoo- 


Summarizing  the  above,  we  have  established  the  least  upper 
bound  property  of  SNRoo  when  V^(t)  is  time-limited.  To  show 
this  for  the  general  case,  let  rpLit)  the  [—LTc,  LTc]  trunca¬ 
tion  of  ip{t).  Notice  that  we  scale  the  energy  of  ipUt)  so  that 
it  is  equal  to  Tc-  Then,  it  is  easy  to  show  that  i>L{t)  converges 
to  uniformly.  Moreover,  if  we  let  SNRo  be  the  SNR  with 
ip{t)  and  SNR^  be  the  SNR  with  rpiit),  limi_oo  SNR  = 
SNRo.  But  from  the  time-limit^  case,  SNR^  <  SNR^,  where 
SNR^  is  given  by  (31)  with  V’(/)  replaced  by  i>L{t).  It  is  also 
easy  to  show  that  lim/,_,oo  SNR^  =  SNRoo-  Hence,  by  letting 
00,  we  have  SNRo  <  SNRoo  <  7o-  On  the  other  hand, 
the  dyadic  receiver  gives  SNR  as  close  to  SNRoo  as  desired 
when  sampling  rate  increases. 


Appendix  B 

In  this  appendix,  we  discuss  the  achievability  of  the  bound 
in  (35). 

Suppose  the  a-BW  of  the  system  is  Wa-  By  the  Schwartz 
inequality,  df  >  We  can  get  (35)  by 

putting  this  inequality  into  (34).  Given  the  a-BW,  we  note 
that  equality  in  (35)  is  achieved  when  and  only  when  «’(/) 
is  constant  on  [-W„/2,  Wj2y  There  might  not  exist  chip 
waveforms  satisfying  this  condition  and  the  Nyquist  criterion 
simultaneously  for  all  values  of  a  and  Wa-  Hence,  the  bound 
in  (35)  might  not  be  always  achieved  and  the  optimization  of 
ANFEoo  is  difficult  in  general. 

However,  in  the  special  case  when  a  =  1  and  Wa  is  slightly 
larger  than  any  integral  multiple  of  1/Tc,  we  can  get  as  close  to 
the  bound  as  desired.  Consider  the  frequency-domain  raised- 
cosine  (n,  /0)  chip  waveform  defined  by  «'(/)  as  shown  in  (46) 
at  the  bottom  of  the  page  where  0  <  jS  <  1  and  n  is  an  mteger. 
Note  that  this  chip  waveform  satisfies  the  Nyquist  criterion  and 
is  band-limited  to  [-(n  -b  /3)/2Te,  (n  -b  /3)/2Tc].  It  is  easy  to 
see  that  ANFEoo  =  l-l/n-by9/4n2  and  (35)  gives  ANFE,c  < 
1  -  l/(n  -b  P).  We  achieve  equality  in  (35)  for  /?  =  0  and  any 
fixed  n.  Unfortunately,  the  autocorrelation  function  v{t)  of 
the  frequency-domain  raised-cosine  (n,  0)  waveform  does  not 
satisfy  the  r^uirement  that  dt  <  00,  which  implies 

that  it  is  not  a  legitimate  chip  waveform.  Nevertheless,  we 
can  get  as  close  to  the  bound  as  desired  by  choosing  .3  to  be 
small  enough.  As  a  consequence,  we  can  conclude  that  the 
frequency-domain  raised-cosine  (n,  0)  waveform  (for  small 
)8)  is  almost  optimal  under  the  constraint  that  the  l-B^'  of  the 
system  is  limited  to  (n  -b  I3)/Tc. 
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Performance  of  a  Type-II  Hybrid  ARQ  Protocol 
in  Slotted  DS-SSMA  Packet  Radio  Systems 

Qian  Zhang,  Tan  F.  Wong,  and  James  S.  Lehnert,  Senior  Member.  IEEE 


Abstract — In  this  paper,  the  application  of  a  type-II  hybrid 
ARQ  protocol  in  a  slotted  direct-sequence  spread-spectrum 
multiple-access  (DS-SSMA)  packet  radio  system  is  investigated. 
Both  the  static  performance  and  the  dynamic  performance 
of  such  a  system  are  analyzed.  In  the  physical  layer,  packet 
error  and  packet  success  probabilities  are  computed  using  the 
improved  Gaussian  approximation  technique,  which  accounts  for 
the  bit-to-bit  error  dependence  within  a  packet.  In  the  data-link 
layer,  two-dimensional  Markov  chains  are  employed  to  model 
the  system  dynamic.  Based  on  this  model,  the  performance  of 
the  type-II  hybrid  ARQ  protocol  is  upper  and  lower  bounded 
by  considering,  respectively,  a  superior  scheme  and  an  inferior 
scheme.  Steady  state  throughput  and  delay  performances  of  the 
two  bounding  schemes  are  obtained.  Moreover,  it  is  shown  that 
for  each  fixed  input  load,  there  is  an  optimal  retransmission 
probability  under  the  finite  user  population  assumption.  Bounds 
on  this  optimal  retransmission  probability  are  also  given. 

Index  Terms— Automatic  repeat  request,  code  division  multiac¬ 
cess,  Markov  processes,  networks,  packet  radio,  spread  spectrum 
communication. 


L  Introduction 

PURE  automatic  repeat  request  (ARQ),  type-I  hybrid  ARQ 
[1],  type-n  hybrid  ARQ  [2],  [3],  and  code  combining 
[4H6]  are  data-link  layer  protocols,  with  increasing  com¬ 
plexities,  which  try  to  provide  error-free  transmission  for 
packet  radio  systems  over  noisy  communication  channels. 
Roughly  speaking,  because  of  the  incorporation  of  forward 
error  correcting  capability  into  the  feedback  error  control 
protocols,  hybrid  ARQ’s  and  code  combining  give  higher 
average  system  throughput  than  pure  ARQ.  On  the  other 
hand,  since  type-II  hybrid  ARQ  and  code  combining  adapt  to 
changing  channel  conditions,  they  give  lower  average  system 
delay  than  pure  ARQ  and  type-I  hybrid  ARQ.  Type-II  hybrid 
ARQ  has  the  potential  to  serve  as  a  practical  compromise 
between  system  performance  and  complexity.  However,  more 
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detailed  complexity  and  performance  tradeoff  analyses  are 
needed  to  establish  its  usefulness. 

Direct-sequence  spread-spectrum  (DS/SS)  signaling  [7],  [8] 
is  an  effective  physical  layer  modulation  technique  which  al¬ 
lows  multiple  users  to  have  simultaneous  access  to  a  common 
channel  in  a  packet  radio  system.  Due  to  the  presence  of  the 
characteristic  multiple-access  interference  (MAI)  in  the  DS/SS 
signaling  technique,  packet  error  control  protocols  such  as 
those  mentioned  above  have  to  be  applied  in  the  data-link  layer 
to  ensure  error-free  transmission  of  packets  when  a  feedback 
channel  is  available.  Several  researchers  have  proposed  and 
analyzed  the  application  of  some  of  the  feedback  error  control 
protocols  to  direct-sequence  spread-spectrum  multiple-access 
(DS-SSMA)  systems.  The  application  of  pure  ARQ  to  DS- 
SSMA  systems  is  examined  in  [9].  DS-SSMA  systems  with 
type-I  hybrid  ARQ  are  examined  in  [8]  and  [10].  The  use  of 
code  combining  is  investigated  in  [11]  and  [12]. 

The  major  difficulty  in  analyzing  the  feedback  protocols 
described  above  is  that  the  channel  condition,  in  particular,  the 
packet  error  probability,  under  the  DS/SS  signaling  technique 
depends  on  the  number  of  active  transmitting  users  in  the 
system.  Hence,  a  reasonable  analysis  of  such  a  system  has  to 
condition  on  the  number  of  active  users  at  a  particular  moment. 
In  [1 1],  the  number  of  active  transmitting  users  is  assumed  to 
be  fixed.  In  [12],  the  numbers  of  active  users  at  different  time 
slots  are  assumed  to  be  independent  identically  distributed 
Poisson  random  variables.  Since  the  number  of  transmitting 
and  retransmitting  users  at  the  current  moment  depends  on  the 
previous  transmission  events,  the  number  of  active  transmitting 
users  is  actually  a  stochastic  process  with  dependent  time 
samples.  As  a  result,  the  aforementioned  traffic-dependent 
characteristic  of  a  DS-SSMA  channel  is  actually  ignored  in 
[11]  and  [12],  On  the  other  hand,  it  is  shown  in  [9]  and  [10] 
that  system  dynamics  under  the  pure  ARQ  and  type-I  hybrid 
ARQ  protocols  can  be  modeled  by  time-homogeneous  Markov 
chains,  and  hence,  exact  analyses  of  these  two  protocols  can 

be  obtained.  u  u 

In  this  paper,  we  investigate  the  use  of  the  type-II  hybrid 
ARQ  protocol  (ARQ2)  proposed  in  [3]  to  slotted  DS-SSMA 
packet  radio  systems  under  the  finite  user  population  assump¬ 
tion.  This  protocol  acts  as  a  simple  adaptive  coding  scheme 
to  cope  with  the  time-varying  channel  conditions  of  the  DS- 
SSMA  channel.  Details  of  the  DS-SSMA  signaling  technique 
and  ARQ2  are  given  in  Section  II.  By  assuming  the  number  of 
active  transmitting  users  in  the  system  to  be  fixed,  we  present 
several  static  system  performance  measures  such  as  the  packet 
error,  undetectable  error,  and  packet  success  probabilities  in 
Section  III.  The  improved  Gaussian  approximation  technique 
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[7],  which  accounts  for  the  bit-to-bit  error  dependence  within 
a  packet,  is  employed  to  obtain  accurate  results.  In  Section  IV, 
we  show  that  the  system  dynamic  can  be  modeled  by  a  multi¬ 
dimensional  time-homogeneous  Markov  chain.  This  approach 
is  similar  to,  but  considerably  more  complex  than,  those  in  [9] 
and  [10],  where  one-dimensional  Markov  chains  are  employed. 
Compared  to  [1 1]  and  [12],  the  traffic-dependent  characteristic 
of  the  DS-SSMA  channel  is  accurately  modeled  using  this 
approach.  Due  to  the  complexity  of  the  multidimensional 
model,  an  exact  analysis  of  the  system  cannot  be  performed 
easily.  Instead,  we  introduce  an  inferior  scheme  and  a  superior 
scheme,  which  bound  the  performance  of  ARQ2.  Since  both 
the  inferior  and  superior  schemes  can  be  modeled  by  two- 
dimensional  Markov  chains,  system  performances  can  be 
obtained  easily.  In  particular,  we  present  the  steady  state 
throughput  and  delay  performances  of  the  bounding  schemes. 
Moreover,  we  also  show  that  for  each  fixed  input  load,  there 
is  a  unique  optimal  retransmission  probability  in  the  sense 
of  maximizing  the  throughput  and  minimizing  the  delay. 
Similar  optimal  retransmission  probabilities  are  obtained  for 
the  pure  ARQ  and  type-I  hybrid  ARQ  protocols  in  [9]  and 
[10],  respectively.  Finally,  to  demonstrate  the  performance  of 
ARQ2,  we  present  numerical  results  of  the  dynamic  analysis 
of  a  sample  system  in  Section  V. 

n.  System  Model 

In  this  section,  we  describe  the  DS-SSMA  signaling  tech¬ 
nique  and  ARQ2  employed  in  the  physical  layer  and  the 
data-link  layer  of  the  system,  respectively. 

A.  Slotted  DS-SSMA  Packet  Radio  Network 

We  assume  that  DS-SSMA  signaling,  which  provides  the 
basis  for  the  multiple  access  capability  of  the  system,  is  em¬ 
ployed  as  the  digital  modulation  scheme  in  the  physical  layer. 
Without  loss  of  generality,  we  assume  that  the  transmitters 
and  receivers  in  the  DS-SSMA  system  work  in  pairs.  Each 
transmitter-receiver  pair  uses  an  independent  aperiodic  random 
signature  sequence,  with  spreading  factor  N  {N  chips  per  bit), 
to  modulate  data  bits  in  the  BPSK  format.  For  simplicity,  we 
ignore  the  presence  of  thermal  noise  and  assume  that  possible 
bit  errors  are  only  caused  by  MAI  from  other  active  users  in 
the  system. 

We  model  the  system  as  a  finite-user  slotted  packet  radio 
network  with  packet  size  of  L  bits.  There  are  altogether 
U  users  (active  or  not)  in  the  system.  The  slotted  trans¬ 
mission  is  interpreted  in  such  a  way  that  the  transmissions 
of  packets  from  different  users  are  only  approximately  syn¬ 
chronous  within  a  few  chip  durations.^  This  alleviates  the 
difficulty  in  synchronizing  the  transmissions  of  packets  from 
different  users.  We  assume  that  both  the  spreading  factor 
N  and  the  packet  size  L  are  large  as  in  any  practical  DS- 
SSMA  system.  Therefore,  the  system  can  be  viewed  as  an 
asynchronous  one  on  the  chip  level,  while,  on  the  packet 
level,  it  is  a  slotted  system.  Since  such  a  system  satisfies 
the  requirements  described  in  [7],  the  improved  Gaussian 

^  Guard  times  have  to  be  added  between  adjacent  slots  to  prevent  inter-slot 
interference. 
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Fig.  1.  Coding  structure  of  the  DS-SSMA  system  with  the  type-II  hybrid 
ARQ  protocol. 


approximation  technique  [7],  which  accounts  for  the  bit-to- 
bit  error  dependence  within  a  packet,  can  be  employed  to 
compute  the  packet  error  probability  given  the  number  of 
simultaneous  active  users  in  the  system.  We  note  [8]  that 
since  the  improved  Gaussian  approximation  gives  accurate 
error  probability  estimates  regardless  of  whether  there  are  a 
large  number  of  active  users,  our  results  are  reliable  under  all 
load  conditions. 

R.  Type-II  Hybrid  ARQ  Protocol 

We  assume  that  there  is  an  error-free  feedback  channel  from 
the  receiver  back  to  the  transmitter  to  convey  all  necessary 
control  information,  transmission  errors  are  detected  by  the 
receiver,  and  the  transmitter  is  immediately  acknowledged 
after  each  packet  transmission.  We  note  that  this  assumption 
is  valid  for  terrestrial  radio  communication  systems  in  which 
the  transmission  delay  over  the  link  is  much  smaller  dian  the 
packet  duration.  With  the  presence  of  the  feedback  channel, 
ARQ  protocols  can  be  employed  to  ensure  error-free  transmis¬ 
sion  of  packets.  In  this  paper,  we  suggest  the  application  of 
the  following  type-II  hybrid  ARQ  protocol  [3]  in  the  slotted 
DS-SSMA  packet  radio  network. 

Two  block  codes  Co  and  Ci,  where  Co  is  an  (L,  K)  high- 
rate  (K  is  slightly  smaller  than  L)  error-detecting  code  and  Ci 
is  an  (2L,  L)  half-rate  invertible  systematic  code^  with  error- 
correcting  capability  fi,  which  is  designed  for  error  correction 
only,  are  used.  Transmission  errors  in  a  packet  are  detected 
by  calculating  the  syndrome  of  code  Cq.  We  assume  that  an 
idle  user  generates  a  new  if -bit  data  block  with  probability  pi. 
(L-if )  parity  bits  based  on  Go  sre  appended  to  the  if -bit  data 
block  forming  an  L-bit  data  packet.  We  denote  by  J  =  (D,  Q) 
this  L-bit  data  packet,  which  is  a  codeword  in  Co,  where  D  is 
the  if -bit  data  block  and  Q  is  the  corresponding  (L —  if  )-bit 
parity  block.  An  L-bit  parity  check  packet,  P(/),  of  I  based 
on  code  Ci  in  such  a  way  that  (/,  P{I))  is  a  codeword  of  Ci 
is  constructed.  The  coding  structure  is  shown  in  Fig.  1. 

The  data  packet  I  is  first  sent.  The  receiver  calculates  the 
syndrome  of  the  received  packet  based  on  Cq.  If  the  syndrome 
is  zero,  the  transmission  is  assumed  to  be  error-free  and 

^These  codes  are  abundant,  as  it  is  shown  in  [1]  that  we  can  shorten  any 
cyclic  code  with  a  suitable  rate,  for  example,  a  BCH  or  a  Reed-Solomon  code, 
to  obtain  a  half-rate  invertible  code  without  compromising  the  error  correcting 
capability. 
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the  transmitter  is  acknowledged.  Otherwise,  the  received  data 
packet  is  saved  in  a  buffer  and  the  transmitter  is  informed 
to  restart  the  transmission.  Upon  a  transmission  failure,  the 
transmitter  will  send  the  parity  packet  F(/)  with  retransmis¬ 
sion  probability  pi,  (p(,-persistent).  The  receiver,  then,  checks 
the  syndrome  of  P{I)  (see  [3]  for  details).  If  it  is  zero, 
the  transmission  is  assumed  to  be  successful  and  P{I)  is 
inverted  to  obtain  /.  Otherwise,  P{I),  combined  with  the 
previously  stored  /,  is  decoded  based  on  Cj.  If  the  syndrome 
of  the  decoded  packet  /  is  zero,  the  transmission  is  declared 
successful.  Otherwise,  I  will  be  replaced  by  P{I)  in  the 
receiver  buffer  and  the  transmitter  is  informed  to  retransmit 
once  again.  The  data  packet  /  and  the  parity  check  packet 
P{I)  are  sent  alternately  in  the  way  described  above  until  the 
transmission  is  successful. 

We  assume  that  a  backlogged  user,  who  fails  the  transmis¬ 
sion  of  the  current  packet,  is  prevented  from  generating  new 
packets  before  successful  retransmission  of  the  current  back- 
logged  packet.  After  successful  retransmission,  the  backlogged 
user  will  become  an  idle  user  again. 

III.  Static  Analysis 

The  channel  characteristics  of  the  DS-SSMA  system  depend 
on  the  number  of  active  users  in  the  -channel  at  a  particu¬ 
lar  moment.  In  this  section,  we  study  several  static  system 
performance  measures,  namely,  the  packet  error  probability, 
undetectable  error  probability,  and  the  packet  success  prob¬ 
ability,  conditioned  on  the  number  of  active  users  in  the 
channel. 

A.  Packet  Error  Probability 

First,  we  derive  the  packet  error  probability.  Based  on  the 
improved  Gaussian  method  in  [7],  an  accurate  approximation 
of  the  packet  error  probability  accounting  for  different  channel 
conditions  can  be  obtained.  Let  Pe{i  1  k)  be  the  probability  of 
the  event  that  i  errors  occur  in  a  packet  given  that  there  are  k 
active  users  transmitting  in  the  same  time  slot.  Then 


where  Q(  )  =  1  with  $(•)  representing  the  distribution 

function  of  a  standard  normal  random  variable.  We  note  that 
the  random  variable  ^  has  the  density  function 


the  packet  error  event  of  a  user  is  conditionally  independent 
of  error  events  in  other  time  slots  and  those  of  the  other  users 
in  the  same  time  slot. 

B.  Undetectable  Error  Probability 

Next,  we  consider  the  probability  Pf.,ARQ2  that  an  erroneous 
data  packet  is  delivered  by  ARQ2.  We  let  Pn  be  the  probability 
that  an  erroneous  received  packet  is  undetected  by  the  error- 
detecting  code  Cq.  By  the  construction  of  Co  and  Ci,  it  can 
be' shown  [3]  that  Pu  is  the  same  regardless  of  whether  the 
received  packet  is  a  data  packet  or  a  parity  packet.  Suppose 
the  number  of  active  users  is  fixed  to  A;,  by  combining  the 
random  coding  argument  in  [1]  and  the  improved  Gaussian 
approximation  in  [7],  for  a  suitably  chosen  Co,  we  have 

F„|fc  <  -  ^-(0  I  k)]  <  (4) 

We  can  also  show  that 

2Pu\k  ^ 

<  p„|,  +  p,(0lA:)  -  P.(0|(/)  • 

Since  the  bounds  in  (4)  and  (5)  do  not  depend  on  k  and 
Pe(0  I  (7)  >  0,  we  can  choose  Cq  {L  —  K  large  enough)  to 
make  both  of  the  (unconditional)  probabilities  and  Pc.arqz 
arbitrarily  small.  Therefore,  from  now  on,  we  assume  that  Co 
is  powerful  enough  to  detect  all  errors  in  a  packet  and  no 
erroneous  data  packet  will  be  delivered  by  the  ARQ2. 


C.  Packet  Success  Probability 

We  give  a  simple  demonstration  of  the  improvement  due  to 
the  adaptive  coding  capability  of  ARQ2.  Here,  we  consider  the 
1 -persistent  retransmission  policy  and  suppose  that  there  is  a 
mechanism  to  fix  the  number  of  active  users  in  each  time  slot 
to  be  k,  where  1  <  A:  <  C.  We  look  at  one  of  the  active  users 
and  let  P,(f  |  fc)  be  the  probability  of  successful  transmission 
in  the  fth  slot  for  this  user.  Then,  it  is  not  hard  to  see  that 

Ps(0  1  fc)  =  Pe(0  I  fc)  and  for  f  >  1 

Ps{t  I  fc)  =  Pe(0  I  fc)  +  [1  -  Ps{t  -  1  I  fc)]Pc,  (fc)>  (6) 


where 

Pc,(fc)=  X)  Pe{m\k)Pe{n\k)  <1.  d) 

m,n>  1 
m-Fn<ti 

Solving  (6),  we  have,  for  f  >  0 


Ps{t  I  fc)  , 

Pe(0  I  fc-)  -1-  Pc,  (fc)  +  [-Pc,(fc)]*'^Mi  -  I 
1  +  Pc,{fc-) 


where  +  denotes  the  convolution  operator  and 


h\M)  = 


log 


(3) 


We  note  that,  since  the  spreading  sequences  are  aperiodic  and 
independent,  given  the  number  of  active  users  fc  in  a  time  slot. 


Hence,  the  stationary  packet  success  probability 


(9) 


From  (9),  we  see  that  when  P„(()  1  fc)  is  small,  the  performance 
of  the  system  is  determined  by  the  error  correcting  capability 
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of  Cl,  This  can  be  much  better  than  the  pure  ARQ  protocol 
in  which  Ps{k)  =  Pe(0  |  fc). 

To  demonstrate  the  advantage  of  ARQ2  over  pure  ARQ  in 
the  DS-SSMA  system,  we  consider  a  sample  system  using  the 
code  Cu  adopted  from  [3],  with  L  =  500  and  ti  =  54,  The 
spreading  factor  N  is  set  to  31,  and  there  are  50  users  in  the 
system,  i.e.,  U  =  50.  Fig,  2  shows  Pe(0  |  k)  and  Ps{k)  as 
functions  of  k.  We  see  that  ARQ2  provides  a  reasonably  high 
packet  success  probability  for  a  much  larger  range  of  k  than 
pure  ARQ. 


IV.  Dynamic  Analysis 

As  mentioned  in  Section  I,  due  to  the  traffic-dependent 
nature  of  the  DS-SSMA  channel,  the  number  of  active  users  in 
a  time  slot  is  a  stochastic  process  with  dependent  time  samples. 
The  static  analysis,  as  well  as  the  analyses  in  [11]  and  [12], 
only  give  crude  approximations  to  the  real  behaviors  of  the 
systems.  In  this  section,  we  perform  dynamic  analyses  taking 
the  traffic-dependent  characteristic  of  the  DS-SSMA  channel 
into  account. 

A.  Multidimensional  Markov  Model 

Due  to  the  statistical  nature  of  the  error  and  retransmission 
events,  the  number  of  active  users  k  varies  from  slot  to  slot. 
Since  ARQ2  uses  only  the  stored  packet  in  the  receiver  buffer 
and  the  current  received  packet  to  perform  decoding,  the  next 
state  of  the  system  is  independent  of  all  the  previous  activities 
given  the  current  stored  packets  in  all  the  receiver  buffers. 
Moreover,  by  the  construction  of  Cq  and  C\ ,  the  decoder  sees 
the  same  statistical  properties  whether  the  current  received 
packet  is  a  data  packet  or  a  parity  packet.  Hence,  the  dy¬ 
namic  of  the  system  can  be  modeled  by  a  time-homogeneous*^ 

•^For  a  noninvertible  code,  we  cannot  use  a  time-homogeneous  Markov 
chain  to  model  the  system  dynamic.  The  reason  is  that  for  the  case  when 
the  received  parity  check  packet  is  error-free,  we  cannot  invert  it  to  obtain 
the  data  packet.  However,  this  event  can  be  neglected  when  there  are  a  large 
number  of  active  users  transmitting  (the  system  is  operating  near  its  capacity) 
since  the  chance  of  having  an  error-free  received  packet  is  low.  As  a  result, 
for  the  operating  region  of  interest,  the  time-homogeneous  Markov  model  and 
the  analysis  in  this  paper  are  good  approximations  for  a  system  employing  a 
noninvertible  code. 


TABLE  I 

Error-Correction  Capabilities  of  the  Inferior  and  Superior  Schemes 


#  errors  In  received  packet 

7-/2  >t2 


2  A 


x/ 

X 

X 

X 

Inferior  scheme 

v/  correctable 


#  errors  In  received  packet 

l-t2  >t2 


2 

o 


X  uncorrectable 


discrete-time  Markov  process.  More  precisely,  we  let  Ui{t), 
where  I  <  i  <  h  —  I,  denote  the  number  of  users  who  fail 
the  decoding  in  the  {t  —  l)th  slot  with  i  errors  in  each  stored 
packet.  Moreover,  we  use  Uo{t)  to  denote  the  number  of  idle 
users  in  the  ^th  slot  and  Ut^  (t)  to  denote  the  number  of  users 
who  fail  the  decoding  in  the  {t  -  l)th  slot  with  each  stored 
packet  having  at  least  ti  errors.  We  note  that  there  are  ti  +  l 
groups,  and  the  constraints  0  <  Ui  <  U  for  0  <  i  ^  ti  and 

Uo  +  Ui-\ - hC/ti  =  U  must  be  satisfied.  We  define  the  state 

vector  as  X{t)  =  {Uo{t),Ui{t)r  -  ,Ut,^i{t)).  Then,  X{t) 
is  a  -dimensional  time-homogeneous  Markov  chain  which 
gives  a  sufficient  description  of  the  dynamic  of  the  system 
for  the  purposes  in  this  paper.  For  example,  we  can  evaluate 
the  stationary  distribution  of  the  Markov  chain  X{t)  and  then 
obtain  the  steady  state  throughput  and  delay  of  the  system. 
Unfortunately,  the  number  of  states  in  this  model  is  of  the  order 
of  In  practice,  both  ti  and  U  are  large,  and  the  model 
becomes  too  complex  for  practical  numerical  computations. 

B,  Inferior  and  Superior  Schemes 

The  -dimensional  Markov  chain  model  in  the  previous 
section  is  too  complex  for  practical  values  of  ti  and  U,  In 
this  section,  we  introduce  an  inferior  scheme  and  a  superior 
scheme  which  bound  the  performance  of  ARQ2  described  in 
Section  II.  We  show  that  the  system  dynamics  of  each  of 
the  inferior  and  superior  schemes  can  be  modeled  by  a  two- 
dimensional  time-homogeneous  Markov  chain.  As  a  result,  the 
complexity  of  the  analysis  of  them  is  much  smaller  than  that 
required  by  the  multidimensional  model  (order  instead  of 
order  Both  the  inferior  and  superior  schemes  are  based 
on  ARQ2.  They  only  differ  in  terms  of  their  error  correcting 
powers.  Table  I  summarizes  their  error  correcting  capabilities. 
We  recall  that  ARQ2  can  correct  up  to  a  total  of  ti  errors  in  the 
received  packet  and  the  previously  stored  packet.  As  shown 
in  Table  I,  an  error  packet  can  be  successfully  decoded  by  the 
inferior  scheme  only  when  there  are  less  than  t2  =  L^i/2j 
errors  in  each  of  the  received  packet  and  the  previously 
stored  packet.  On  the  other  hand,  the  superior  scheme  can 
give  successful  decoding  unless  there  are  more  than  t2  errors 
in  each  of  the  received  packet  and  the  previously  stored 
packet.  It  is  obvious  that  the  inferior  scheme  has  a  poorer 
error  correcting  capability  than  ARQ2,  while  the  superior 
scheme  has  a  stronger  error  correcting  capability  than  ARQ2. 
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Fig.  3.  State  transition  diagram  for  the  inferior  scheme. 

Therefore,  the  performance  of  ARQ2  is  lower  and  upper 
bounded  by  the  inferior  and  the  superior  schemes,  respectively. 

From  the  discussion  in  Section  IV-A,  the  system  dynamics 
of  each  of  the  two  bounding  schemes  can  be  modeled  by  a 
time-homogeneous  Markov  chain.  To  analyze  the  dynamics  of 
a  user,  it  suffices  to  define  three  states  of  the  user,  namely, 
Idle^  Backlog  If  and  Backlog2,  The  states  W/e,  Backlogly  and 
Backlog!  correspond  to  the  three  events  that  the  previous 
transmission  is  successful,  that  the  previous  transmission  fails 
and  the  store  packet  has  at  most  f  2  errors,  and  that  the  previous 
transmission  fails  and  the  store  packet  has  more  than  t2  errors, 
respectively.  Based  on  Table  I,  it  is  easy  to  verify  that  every 
state  transition  event  of  a  user  is  governed  by  the  following 
set  of  rules. 

1)  In  a  time  slot,  a  user  must  be  in  one  of  the  three  states. 

2)  A  user  in  Idle  transmits  a  new  data  packet  with  prob¬ 
ability  Pi  in  the  current  time  slot.  If  the  transniission 
succeeds,  the  user  will  remain  in  Idle  in  the  next  time 
slot.  Otherwise,  it  will  go  to  either  Backlogl  or  Backlog! 
depending  on  whether  the  received  packet  contains  one 
to  t2  errors,  or  more  than  t2  errors. 

3)  A  user  in  Backlogl  retransmits  a  packet  (data  or  parity) 
with  probability  pb  in  the  current  slot.  If  the  received 
packet  is  error-free  or  contains  at  most  t2  errors,  then 
both  the  inferior  and  superior  schemes  will  give  success¬ 
ful  decoding  and  the  user  will  go  back  to  Idle.  Otherwise, 
the  inferior  scheme  will  give  unsuccessful  decoding  and 
the  user  will  go  to  Backlog!.  On  the  other  hand,  the 
superior  scheme  will  give  successful  decoding  and  the 
user  will  return  to  Idle. 

4)  A  user  in  Backlog!  retransmits  a  packet  (data  or  parity) 
with  probability  pb  in  the  current  slot.  If  the  received 
packet  is  error-free,  i.e.,  the  retransmission  succeeds, 
then  the  user  will  return  to  Idle.  If  the  received  packet 
contains  more  than  t2  errors,  then  both  the  inferior  and 
superior  schemes  will  give  unsuccessful  decoding  and 
the  user  will  stay  in  Backlog!.  Otherwise,  the  inferior 
scheme  will  give  unsuccessful  decoding  and  the  user 
will  go  to  Backlogl  f  while  the  superior  scheme  will  give 
successful  decoding  and  the  user  will  go  back  to  Idle. 

The  respective  state  transition  diagrams  describing  this  set 
of  rules  for  the  inferior  and  superior  schemes  are  shown  in 
Figs.  3  and  4.  In  the  figures,  F°(A:), 
the  probabilities  that  a  received  packet  contains  no  error,  one 
to  t2  errors,  and  more  than  ^2  errors  conditioned  on  fc  active 
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Fig.'4.  State  transition  diagram  for  the  superior  scheme. 

users,  respectively.  It  is  obvious  that 

P^ik)  =  Pe{0  1  k)  (10) 

P}(k)=f^Pe{i\k)  (11) 

1=1 

where 

P°ik)  +  PHk)  +  P!W^^-  (13) 

C.  Two-Dimensional  Markov  Model 

With  the  classification  in  the  previous  section,  the  dynamic 
of  the  whole  system  can  be  modeled  by  a  two-dimensional 
time-homogeneous  Markov  chain.  Let  i{t)  and  j{t)  represent 
the  total  number  of  users  in  Backlogl  and  Backlog2  in  the 
tth  slot,  respectively.  Hence,  the  total  number^  of  users  in 
Idle  is  U  -  i(f)  -  j{t).  The  state  vector  X(f)  =  j'W) 
forms  a  two-dimensional  Markov  chain  with  state  transition 

probabilities,  for  all  t  >  0, 

Pdi^r,li,l)  =  Pr{X(f+  1)  =  {kj)  I  X(t)  =  {hj),0  (14) 

where  ^  €  {/.S'}  indicates  the  inferior  and  superior  schemes, 
respectively.  We  note  that  the  state  indexes  i,  j,  fc,  and  I  satisfy 
the  constraints  that  0  <  i,j,  k,l<U,  i-i-j  <  U,  and  fc+I  <  U. 
The  dependence  of  all  the  variables  on  the  time  t  is  implicit 

in  the  notation  above. 

Suppose  in  the  tth  slot,  n  out  of  the  U  -  i-  j  users  in 
Idle,  i\  out  of  the  i  users  in  Backlogl,  and  ji  out  of  the  j 
userc  in  Backlog2  send  packets  to  the  channel.  Hence,  there  are 
active  users  in  the  tth  slot.  Since  the  (re)transmission 
events  are  independent  of  each  other  and  all  other  events  given 
i  and  j,  the  probability  of  the  above  event  is 

P{n,h,h  I  L.?)  =  PiP  -  i-:hn,pi)D{i,iuV„)BUJi^Pi') 

(15) 

where  )  denotes  the  binomial  distribution,  i.e.,  for 

0  <  m  <  M  and  0  <  p  <  1 
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Then, 

=  ^  I  7i,iuji)P{n,H,ji  I  i,j) 

n=0  i\  —0  j)  =0 

(17) 

where  \  n,ii,j\)  is  the  transition  probability  from 

state  {i,j)  to  state  (A;,/)  conditioned  on  ii,  ?'i,  and  ji.  The 
detailed  derivation  of  P^{i.j:kJ  \  n^ii.ji)  for  both  the 
inferior  and  superior  schemes  is  given  in  the  Appendix. 

We  arrange  the  {U  +  1){U  +  2)/2  states  into  the  row  vector 

[(0, 0),  • .  • ,  (0,  U),  (1,0), . . . ,  (1,  C/  -  1),  •  •  • ,  (f/,  0)].  (18) 


With  the  time  dependent  notations  in  Section  IV-C,  we  have, 
for  t  >  0 

•/:(/.  +  1)  +  :j{t  +  1)  -  i{t)  +  j{t)  +  n{t)  -  .s{t)  (24) 

where  s{t)  is  the  number  of  successfully  (re)transmitted  pack¬ 
ets  in  the  Ath  slot.  Taking  expectations  in  (24)  and  noting  that, 
for  each  ^  E  {PS} 

liin  E[i{t)  +  j{t.)]  = 

t—^OO 

lim  E[n{t)]  = 

t—*oo 

lim  E[s{t)]  = 

t — >oo 

we  see  that  the  average  throughput  is 


We  can  obtain  the  elements  of  the  state  transition  matrix 
according  to  the  ordering  in  (18)  by  (17).  Since  the  Markov 
chain  X(f)  is  irreducible,  a  stationary  distribution  exists. 
According  to  the  ordering  in  (18) 


S^  =  A^^PiU{l^P^).  (25) 

Therefore,  we  can  apply  Little’s  theorem  [13]  to  obtain  the 
steady  state  average  delay  of  the  system 


TT^  =  N(0,0),...,7r^(0,t/),.--,7r^(f/,0)]  (19) 

where  7r^{i,j)  is  the  stationary  probability  of  state  (i,i). 
Moreover,  can  be  found  by  solving  the  following  set  of 
equations: 


TT^  = 

U  U-i 

(20) 

t=0  j=0 

(21) 

D.  System  Throughput,  Delay,  and  Traffic  Intensity 

From  the  stationary  distribution  obtained  in  the  previous 
section,  we  can  find  out  the  average  throughput  of  the  system, 
which  is  defined  as  the  steady  state  average  number  of 
successfully  transmitted  packets  per  slot,  and  the  average  delay 
of  the  system,  which  is  defined  as  the  average  number  of  time 
slots  required  to  successfully  transmit  a  data  packet  when  the 
system  reaches  its  steady  state. 

First,  the  steady  state  average  number  of  backlogged  users 
in  the  system  is 

u  U-i 

=  (22) 

t=0  j=0 

where  ^  5}  represents  the  inferior  and  superior  schemes, 

respectively.  We  define  the  steady  state  backlog  ratio  as 
=  N^/U,  which  measures  the  degree  of  congestion  in  the 
system.  Also,  the  steady  state  average  number  of  new  data 
packets  generated  per  slot  is 

u  u-i 

i=o  i=o 
u  u-i 

i=0  j—0 

=  p;(/(l -/)«).  (23) 


D^^l  + 


—  1  “h 


Pi(l -/?€)' 


(26) 


Also,  we  can  obtain  the  average  traffic  intensity,  which  is  de¬ 
fined  as  the  steady  state  average  number  of  active  transmitting 
users  (packets)  in  a  slot 


+  (27) 


E,  Optimal  Retransmission  Probability 

Our  objective  is  to  maximize  the  system  throughput  and 
minimize  the  system  delay.  Suppose  the  packet  generation 
probability  pi  is  fixed,  by  eliminating  from  (25)  and  (26), 
we  get 


{piD^  -{- 1  -  pi)S^  =  piU,  (28) 

Since  (28)  holds  for  all  values  of  ph  ranging  from  zero  to  one, 
maximizing  the  throughput  implies  minimizing  the  delay 
simultaneously.  Moreover,  from  (25)  and  (26),  we  see  that 
the  maximization  of  and  minimization  of  is  equivalent 
to  the  minimization  of  the  backlog  ratio  It  is  easy  to  see 
from  the  previous  two  sections  that  for  both  the  inferior  and 
superior  schemes,  0^  is  a  bounded  rational  function  of  pfc. 
Hence,  0^  must  attain  its  minimum  value  for  ph  €  [0, 1],  and 
this  minimum  value  can  only  be  attained  at  a  finite  number 
of  different  values  of  ph>  On  the  other  hand,  once  0^  passes 
its  minimum  value,  increasing  the  retransmission  probability 
Pi  will  only  cause  more  congestion,  and  hence,  increase  in 
/3^.  This  simple  physical  observation  indicates  that  0^  attains 
its  minimum  at  a  unique  value  of  pi.  In  summary,  for  every 
fixed  Pi,  there  is  a  unique  optimal  retransmission  probability, 
denoted  by  in  the  sense  of  maximizing  the  system 
throughput  and  minimizing  the  system  delay  simultaneously 
under  both  the  inferior  and  superior  schemes. 

Applying  a  similar  argument  to  the  multidimensional 
Markov  model  of  the  adaptive  coding  scheme  in  Section  IV-A, 
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we  can  also  show  that  for  every  fixed  pi,  there  is  a  unique 
optimal  retransmission  probability,  denoted  by  pj,  for  ARQ2. 
Moreover,  by  simple  physical  arguments  as  above,  Pj  is 
bounded  between  plj  and  pi  s- 

V.  Numerical  Results 

In  this  section,  we  present  numerical  results  of  the  dynamic 
analysis  in  Section  IV  for  the  sample  system  described  in 
Section  m-C.  We  recall  that  the  system  parameters  for  the 
sample  system  are:  t/  =  50,  L  =  500,  fi  =  54,  and  N  =  31. 
In  the  figures  below,  we  use  solid  lines  and  broken  lines 
to  show  anal5dical  results  obtained  from  Section  IV  for  the 
inferior  and  superior  schemes,  respectively.  Moreover,  results 
obtained  from  Monte  Carlo  simulations  for  ARQ2  are  shown 
by  dotted  lines.  Our  main  objective  here  is  to  illustrate  the  tod 
of  information  which  can  be  obtained  by  the  analysis  described 
in  Section  IV.  Following  the  procedures  given  below,  it  is 
trivial  to  investigate  the  effects  of  different  parameters,  such 
as  N,  U,  and  h,  on  the  performance  of  the  system. 

A.  Delay-Throughput  Characteristic 

Fig.  5  shows  the  delay-throughput  characteristic  of  the 
system  for  three  different  values  of  p6,  namely,  pt  =  0-3, 
0.6,  and  0.9.  Each  curve  is  the  delay-throughput  locus  with 
Pi  varying  from  zero  to  one.  In  the  same  figure,  the  delay- 
throughput  characteristic  of  a  system  with  the  pure  ARQ 
protocol  [14]  is  also  shown  for  comparison.  We  see  from  the 
delay-throughput  characteristic  of  the  inferior  scheme  in  Fig.  5 
that  with  a  suitable  choice  of  P6  and  a  slight  increase  in  delay, 
we  can  at  least  double  the  throughput  by  using  ARQ2  instead 
of  pure  ARQ.  For  P6  =  0.6  and  0.9,  for  both  the  superior  and 
inferior  schemes,  the  system  reaches  its  capacity  (the  turning 
points  at  the  delay-throughput  loci)  before  pi  reaches  one. 
Beyond  those  turning  points,  an  increase  in  pi  causes  more 
congestion,  and  hence,  decreases  the  throughput  and  increases 
the  delay.  However,  for  pt  =  0.3,  the  system  never  reaches 
its  capacity.  We  can  see  from  the  figure  that  before  the  system 
reaches  its  capacity,  the  inferior  and  superior  schemes  closely 


bound  the  actual  performance  of  ARQ2.  Beyond  that,  the 
bounds  are  not  as  tight. 


B.  Optimal  Retransmission 

The  respective  optimal  retransmission  probabilities  pi  j 
and  pIs  for  the  inferior  and  superior  schemes  derived  in 
Section  IV-E  are  obtained  and  plotted  as  functions  of  pi  in 
Fig.  6.  We  observe  from  the  figure  that  when  pi  is  smaller 
than  a  critical  value  determined  by  fi  and  U  (0.28  and  0.49  for 
the  inferior  and  superior  schemes,  respectively,  in  the  sample 
system),  the  optimal  pb  is  one.  This  means  that  when  the 
system  is  lightly  loaded,  we  should  always  retransmit  any 
failed  packet  immediately.  Beyond  this  critical  pi,  the  system 
becomes  congested.  In  order  to  maximize  the  throughput  and 
minimize  the  delay,  we  should  reduce  pb-  We  note  that  the 
optimal  Pb  for  the  sample  system  is  always  larger  than  0.4. 
This  agrees  with  the  observation  from  Fig.  5  that  the  system 
never  reaches  its  capacity  for  pb  =  0.3. 

The  optimal  traffic  intensity,  maximum  throughput,  Md 
minimum  delay  corresponding  to  the  optimal  rettansmission 
probability  obtained  above  for  each  of  the  bounding  schemes 
are  plotted  in  Figs.  7,  8,  and  9,  respectively.  From  Fig.  7,  we 
see  that  when  pi  increases  beyond  the  critical  value,  m  order 
to  achieve  the  maximum  throughput,  we  have  to  d^rease  pb  to 
the  extent  that  the  traffic  intensity  is  kept  to  a  specific  constant 
level  determined  by  h  and  U  (about  27  and  34  for  the  inferior 
and  superior  schemes,  respectively,  in  the  sample  system),  y 
cross-checking  with  Fig.  2,  we  observe  that  these  two  values 
lie  in  the  range  such  that  Ps{k)  is  about  0.5.  Roughly  speaking, 
in  order  to  achieve  maximum  throughput,  when  Pi  is  larpr 
than  some  critical  value,  we  have  to  choose  p6  to  maintain  (on 
the  average)  a  certain  constant  number  of  active  transmitting 
users  on  the  channel  so  that  the  packet  success  probability  is 

about  0.5.  , 

From  Fig.  8,  we  see  that  the  maximum  throughput  level 

after  Pi  passes  the  critical  value.  This  indicates  that  the  system 
reaches  its  capacity.  Further  increase  in  pi  will  not  increase 
the  maximum  throughput.  Intuitively,  the  minimum  delay  can 
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Fig.  7.  Optimal  traffic  intensity. 


be  divided  into  two  components,  namely,  the  delay  due  to 
retransmission  because  of  packet  error  and  the  waiting  time 
due  to  the  random  retransmission  policy.  When  pi  is  below  but 
near  the  critical  value,  the  optimal  is  one,  and  the  packet 
success  probability  is  about  0,5  as  discussed  above.  On  the 
average,  we  need  to  transmit  a  packet  twice  and  the  waiting 
time  is  zero  since  =  1.  So,  the  minimum  delay  is  about 
two  slots.  When  pi  passes  the  critical  value,  we  decrease  ph  in 
order  to  maintain  the  packet  success  probability  at  0.5.  We  still 
need  to  transmit  a  packet  twice,  but  the  waiting  time  increases 
since  pt  decreases.  So,  the  minimum  delay  increases  with  pi. 

We  estimate  pi  by  pi  =  {plj  +P6.s)/2  perform  Monte 
Carlo  simulations  to  find  the  traffic  intensity,  throughput,  and 
delay  for  ARQ2  by  using  pi.  The  results  are  also  plotted 
in  Figs,  7-9.  From  the  figures,  we  see  that  these  results  lie 
between  those  of  the  superior  and  inferior  schemes  and  exhibit 
similar  behaviors.  This  indicates  that  pj  is  a  good  estimate  of 
the  real  pj. 

In  a  practical  system,  we  have  to  use  transmitter  buffers  in 
the  transmitters.  From  the  discussion  above,  for  the  sample 
system,  the  transmitter  buffers  should  be  designed  to  give 
Pi  «  0.5.  We  should  select  the  optimal  pl  corresponding  to 
this  Pi.  Then,  we  get  a  throughput  of  14  packets  per  slot  and  a 
delay  of  2.5  slots,  which  are  the  optimal  values  for  the  sample 
system. 

VI.  Conclusion 

In  this  paper,  we  have  applied  the  type-II  hybrid  ARQ 
protocol  in  a  slotted  DS-SSMA  packet  radio  system.  Using 
the  improved  Gaussian  approximation  technique,  we  have 
performed  static  analysis  on  such  a  system  to  obtain  the 
packet  error  and  packet  success  probabilities.  Moreover,  we 
have  employed  two-dimensional  Markov  chains  to  analyze 
the  dynamic  behavior  of  the  system.  Based  on  this  model, 
we  have  upper  and  lower  bounded  the  performance  of  the 
type-II  hybrid  ARQ  protocol  by  considering  the  superior 
scheme  and  the  inferior  scheme,  respectively.  We  have  ob¬ 
tained  the  steady  state  throughput  and  delay  performances  and 
the  optimal  retransmission  probabilities  of  the  two  bounding 
schemes.  Using  the  sample  system  as  an  example,  we  have 
shown  how  to  derive  the  design  criteria  for  the  transmitter 
buffers  in  order  to  achieve  the  system  capacity  with  minimum 
delay. 

Finally,  we  point  out  the  exact  dynamic  analysis  described 
in  Section  IV  is  generally  too  complex  to  perform  a  stability 
analysis  of  the  system.  In  [15],  the  stability  of  the  system 
is  analyzed  based  on  the  equilibrium  point  analysis  (EPA) 
technique,  which  provides  an  approximation  to  the  dynamic 
of  the  system.  Moreover,  extensions  of  the  dynamic  analysis 
to  code  combining  are  not  trivial  because  we  can  no  longer 
model  the  system  dynamic  by  a  time-homogeneous  Markov 
chain.  Approximation  techniques,  such  as  EPA,  are  probably 
needed. 

Appendix 

In  this  appendix,  we  derive  P^{iJ;kJ  \  njiji)  in  (17) 
for  both  the  inferior  and  superior  schemes. 


Fig.  9.  Minimum  system  delay. 
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Suppose  2,  i,  iu  ju  and  n  are  all  given.  From  the  set  of 
rules  described  in  Section  IV-B,  the  state  transitions  of  a  user 
under  the  inferior  and  superior  schemes  are  governed  by  three 
packet  error  events,  namely,  the  received  packet  contains  no 
error,  one  to  t2  errors,  and  more  than  t2  errors.  Since  there  are 
active  users,  the  probabilities  of  these  three  events 
are  Fi>(n  +  ii+  ji),  +  3i)>  and  p2(n  +  h  +  Ji), 

respectively.  Let  Po{ni,n2  |  n,iiiji)  be  the  probability  that 
out  of  those  n  packets  sent  by  users  in  Idle,  ni  of  them  have 
one  to  t2  errors  and  n2  of  them  have  more  than  fa  errors. 
Since  the  error  events  from  different  users  are  independent 
given  n  -f  2i  +  ji, 


Po{ni,n2  1  n.iiJi) 
n\ 


[P^{n  +  ii  +Ji)] 


(n— ni  — 1x2) 


-I  7X2 


(29) 


(n  —  Til  —  Ti2)!Tii!n2! 

•  [Pi(«  +  •  [^e 

where  0  <  ni,  na  <  n,  and  ni+na  <  n.  Let  Pi(fci  | 

be  the  probability  that  out  of  those  ii  packets  retransmitted  by 

the  users  in  Backlogl,  fci  of  them  have  at  most  fa  errors.  Then 

Pi(fci  1  n,ii,ji)  =  P{ii,fci,Pe(n  +  H+ii) 

+P}  (n  +  ii  +  ji))  (30) 


where  0  <  fci  <  ii-  Let  Piih  \  n,  ii,ji)  be  the  probability  that 
out  of  those  ji  packets  retransmitted  by  the  users  in  Backlogl, 
li  of  them  have  at  most  h  errors.  Then 

PaCli  I  =  P(ii,(i,Pe(«  +  *i  +Ji) 

+Pe  (n  +  ii  +ii))  (31) 

where  0  <  li  <  ii-  Note  that  the  three  events  with  probabili¬ 
ties  Po(  ),  Pi(  ),  and  P2(  )  above  are  independent. 

From  the  discussion  above,  ni,  na,  fci,  and  h  completely 
characterize  a  state  transition  of  the  system.  First,  we  consider 
the  superior  scheme.  Given  ni,  na,  h,  and  li  in  a  time  Slot, 
the  system  will  enter  state  {k,l)  in  the  next  slot,  where 

k  =  ni  +  {i  -  ii) 

1  =  na  +  (i  -  ji)  +  {ji  -  h)-  (33) 

Summing  over  all  the  events  leading  into  state  {k,  1),  we  get, 
for 

i-ii<k<n  +  {i  —  ii), 
j  —  ji  <  I  <  n  +  (i  —  ii)  +  3  — 


Ps{iJ\k,l  1  n,ii,ji)  =  5]Po(fc  -  (*  -  h)-«2  I  n.ii,ii) 

7X2 

X  P2(ti2  -I- J  1 1  ri,ii,Ji)  (34) 


where  the  summation  ranges  over 
max(0,  l-j)<  n2  <  min(/  -  {3  -  Ji),  n  +  (i  -  ii)  -  k). 


Otherwise,  Ps(*>i;^i(  I  —  b- 

Similarly,  for  the  inferior  system,  given  ni,  ria,  k\,  and  ti, 
the  system  will  enter  state  (fc,  1)  in  the  next  slot,  where 


Jt  =  ni-l-(t-ti)  +  li  (•’^1 

1  =  712  +  {j  -  ji)  +  (ji  -  (1)  +  (*i  " 


Hence,  we  get,  for 


i  —  ii  <  k  <n  +  {i  —  ii)  +  ji, 
max(7  —  ji,  {i  ~  ii)  +  3  ~  ^)  <l<n  +  i  +  j  —  k, 


Pl{iJ\k;l  I  n,ii,ji) 

=  ^^Po(ni,n2  I  n,ii,ji)P‘zik  -  (i  -  h) 


7Xi  7X2 

-  Til  I  n,n,  ji)  •  A((^i  +  ^2)  +  (i  4-  j) 

-  (A:  +  1)  1  n.iiJi) 


(37) 


where  the  summations  range  over 


max(0,  fc  —  (i  —  ti)  —  ji) 

<  ni  <  min(n,  k  —  (i  —  h),  k  -h  I  —  —  h)  —  j)y 

max(0,  (k  +  1)  —  («  +  j)  -  ^i) 

<  n2  <  min(n,  k  1  -  (i  -  h)  -  j)  - 

Otherwise,  Piii^j^k^l  1  =  0. 
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Buffered  T^pe-II  Hybrid  ARQ  Protocol  for  DS-SSMA 
Packet  Radio  Systems 


Qian  Zhang, *  Tan  F.  Wong,l  and  James  S.  Lehnertl’2 


The  performance  and  stability  of  a  slotted  direct-sequence  spread-spectrum  multiple-access  (DS- 
SSMA)  packet  radio  network  employing  the  type-II  hybrid  automatic-repeat-request  (ARQ) 
protocol  with  finite-length  transmitter  buffers  are  considered.  The  equilibrium  point  analysis  (EPA) 
technique  is  employed  to  analyze  the  system  stabiUty  and  to  approximately  compute  the  system 
throughput,  delay,  and  packet  rejection  probability.  It  is  found  that  the  system  exhibits  bistable 
behavior  in  some  situations.  Issues  of  system  design,  such  as  the  required  length  of  the  transmitter 
buffers  and  the  desirable  region  of  operation  based  on  a  predetermined  performance  requirement 
for  packet  rejection,  are  also  investigated^ _ _ 

KEY  WORDS:  Automatic  repeat  request;  code  division  multiple  access;  equilibrium  point  analysis;  packet 
radio  network;  spread-spectrum  communication. 


1.  INTRODUCTION 

It  is  shown  in  [1, 2]  that  the  type-II  hybrid  automatic- 
repeat-request  (ARQ)  protocol  effectively  increases 
the  system  capacity  of  a  direct-sequence  spread-spectrum 
multiple-access  (DS-SSMA)  packet  radio  network.  The 
complexity  of  the  type-II  hybrid  ARQ  protocol  is  quite 
reasonable.  Apart  from  capacity  and  complexity,  another 
important  issue  affecting  the  performance  of  the  proto¬ 
col  is  system  stability,  which  is  not  considered  in  [1,  2]. 
Moreover,  it  is  assumed  in  [1,  2]  that  a  user  is  prevented 
from  generating  any  new  packets  until  the  transmission  of 
the  current  packet  succeeds.  In  practice,  data  sources  are 
usually  continuous.  Hence,  transmitter  buffers  are  needed 
to  hold  the  incoming  packets  from  the  data  sources.  The 
assumption  of  [1,  2]  is  equivalent  to  assuming  the  trans¬ 
mitter  buffer  of  a  user  can  hold  only  one  packet.  It  is  obvi¬ 
ous  that  this  restriction  is  not  practical  since  using  a  buffer 
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of  length  1  will  reject  too  many  packets  attempting  to  enter 
the  transmitter  buffer.  The  goal  of  this  paper  is  to  investi¬ 
gate  the  performance  and  stability  of  a  DS-SSMA  packet 
radio  network  employing  the  type-II  hybrid  ARQ  protocol 
with  finite-length  transmitter  buffers. 

Although  we  know  [1,  2]  that  the  system  dynamics 
can  be  modeled  exactly  by  a  multidimensional  Markov 
chain,  this  model  is  too  complex  to  allow  us  to  study  the 
system  performance  and  stability.  In  this  paper,  we  em¬ 
ploy  the  equilibrium  point  analysis  (EPA)  technique  [3-5] 
to  approximately  compute  the  system  performance  and 
analyze  the  system  stability.  Approximate  system  through¬ 
put,  delay,  and  packet  rejection  probability  are  obtained 
via  EPA.  The  accuracy  of  the  EPA  approximations  is 
checked  by  comparing  the  results  given  by  EPA  against 
the  results  obtained  from  Monte  Carlo  simulations. 

A  key  result  obtained  in  EPA  of  the  system  is  that 
the  system  stability  can  be  described  in  terms  of  the 
number  of  stable  (defined  in  Section  3.5)  equilibrium 
points.  Like  some  ALOHA  systems  [6,  7],  we  observe 
that  the  system  exhibits  bistable  behavior,  i.e„  there  are 
two  stable  equilibrium  points  in  some  situations.  When 
the  system  is  bistable,  it  oscillates  between  the  two  stable 
equilibrium  points,  one  of  which  corresponds  to  a  sys- 
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tern  state  of  high  throughput  (low  packet  rejection  prob¬ 
ability)  and  the  other  of  which  corresponds  to  a  system 
state  of  low  throughput  (high  packet  rejection  probabil¬ 
ity).  In  this  case,  the  overall  system  performance  is  a 
weighted  average  of  the  performances  at  the  “good”  and 
“bad”  equilibrium  points.  In  designing  a  DS-SSMA  net¬ 
work,  the  oscillation  between  equilibrium  points  should 
be  avoided.  Using  the  results  given  by  the  EPA  tech¬ 
nique,  we  are  able  to  set  up  a  procedure  to  determine 
the  required  length  of  the  transmitter  buffers  so  that  the 
packet  rejection  probability  is  smaller  than  a  predeter¬ 
mined  threshold.  After  choosing  the  transmitter  buffer 
length,  we  can  also  determine  a  desirable  region  of  oper¬ 
ation  for  the  DS-SSMA  network. 

In  Section  2,  we  describe  the  DS-SSMA  network 
employing  the  type-II  hybrid  ARQ  protocol.  In  Section 
3,  we  perform  EPA  on  the  system  by  solving  the  equilib¬ 
rium  system  equations  and  obtain  the  system  throughput, 
delay,  and  packet  rejection  probability.  We  also  provide 
simulation  results  to  check  the  EPA  approximations.  The 
system  stability  is  investigated  by  considering  the  nature 
of  the  equilibrium  points  obtained  by  EPA.  In  Section  4, 
we  develop  a  procedure  to  choose  the  required  length  of 
the  transmitter  buffers  and  establish  a  desirable  region  of 
operation  for  the  DS-SSMA  network.  Finally,  we  con¬ 
clude  this  paper  in  Section  5. 


2.  SYSTEM  DESCRIPTION 

In  this  section,  we  describe  the  DS-SSMA  radio 
network  and  the  type-II  hybrid  ARQ  protocol.  We  model 
the  system  as  a  finite-user  slotted  packet  radio  network 
with  a  packet  size  of  L  bits.  We  assume  that  direct- 
sequence  spread-spectrum  signaling  is  employed  as  the 
digital  modulation  scheme  in  the  physical  layer.  With¬ 
out  loss  of  generality,  we  assume  that  the  transmitters 
and  receivers  in  the  DS-SSMA  system  work  in  pairs. 
There  are  U  transmitter-receiver  pairs  (active  or  not), 
which  are  referred  to  as  users^  in  the  system.  Each  of 
them  uses  an  independent  aperiodic  random  signature 
sequence,  with  spreading  factor  N  {N  chips  per  bit),  to 
modulate  data  bits  in  the  BPSK  format.  For  simplicity, 
we  ignore  the  presence  of  thermal  noise  and  assume  that 
possible  bit  errors  are  only  caused  by  interference  from 
other  active  users  in  the  system.  Moreover,  we  assume 
that  the  received  power  of  each  interfering  signal  is  the 
same  as  that  of  the  desired  user  signal. 

We  assume  that  there  is  an  error-free  feedback  chan¬ 
nel  from  the  receiver  back  to  the  transmitter  to  convey 


all  necessary  control  information,  transmission  errors  are 
detected  by  the  receiver,  and  the  transmitter  is  imme¬ 
diately  acknowledged  after  each  packet  transmission. 
Therefore,  the  application  of  the  following  type-II  hybrid 
ARQ  protocol  [8]  in  the  slotted  DS-SSMA  packet  radio 
network  is  possible. 

Two  block  codes,  Co  and  Cj,  are  used.  The  code 
Co  is  a  {L,K)  high-rate  error-detecting  code,  and  the 
code  Cj,  designed  for  error  correction  only,  is  a  (2L,L) 
half-rate  invertible  systematic  code  with  error-correcting 
capability  t\ .  Any  transmission  error  in  a  packet  will  be 
detected  by  calculating  the  syndrome  of  code  Co-  We 
assume  that  each  data  source  delivers  a  new  /^-bit  data 
block  to  the  corresponding  transmitter  with  probability 
Pi  at  the  beginning  of  each  slot.  There  is  a  packet  buffer 
of  length  M  in  the  transmitter.  If  the  buffer  is  not  full, 
the  data  block  will  be  added  to  the  buffer.  Otherwise,  the 
data  block  will  be  dropped.  Meanwhile,  (L-  K)  parity 
bits  based  on  Co  are  appended  to  the  AT-bit  data  block  at 
the  head  of  the  buffer  forming  an  L-bit  data  packet.  We 
denote  by  /  =  (D,  Q)  this  L-bit  data  packet,  which  is  a 
codeword  in  Co,  where  D  is  the  AT-bit  data  block  and  Q 
is  the  corresponding  (L  -  Ar)-bit  parity  block.  An  L-bit 
parity  check  packet,  P(/),  of  I  based  on  code  Ci  in  such 
a  way  that  (/,P(/))  is  a  codeword  of  Ci  is  constructed. 
The  coding  structure  [8]  is  shown  in  Fig.  1. 

The  data  packet  I  is  first  sent.  If  the  transmission  is 
error-free,  it  is  completed.  Otherwise,  the  received  data 
packet  is  saved  in  a  receiver  buffer,  and  the  transmitter 
is  informed  to  restart  the  transmission.  Upon  a  transmis¬ 
sion  failure,  the  transmitter  will  send  the  parity  packet 
P(/).  If  the  transmission  is  successful,  P(I)  is  inverted  to 
obtain  7.  Otherwise,  P(/),  combined  with  the  previously 
stored  /,  is  decoded  based  on  C\ .  If  the  decoding  is  suc- 


Codeword  C/ 

_ A _ 


Codeword  Co 

_ A _ 


^bits 


L- orbits 


L  bits 


i  D 
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Spreading  Sequence 


N  chips 

Fig.  1.  Coding  structure  of  the  DS-SSMA  system  with  the  type-II 
hybrid  ARQ  protocol. 
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cessful  (i.e.,  the  total  number  of  errors  in  I  and  P(I)  is  at 
most  /|)»  the  transmission  is  completed.  Otherwise,  I  is 
replaced  by  P{I)  in  the  receiver  buffer,  and  the  trans¬ 
mitter  is  informed  to  retransmit  once  again.  The  data 
packet  /  and  the  parity  check  packet  P(I)  are  sent  alter¬ 
nately  in  the  way  described  above  until  the  transmission 
is  successful.  After  a  successful  transmission  has  been 
acknowledged  at  the  end  of  a  time  slot,  the  data  block 
at  the  head  of  the  transmitter  buffer  is  removed. 

It  is  shown  in  [2]  that  the  performance  of  the  DS- 
SSMA  network  is  mainly  determined  by  its  traffic  inten¬ 
sity  since  the  DS-SSMA  signaling  technique  is  interfer¬ 
ence  limited.  We  need  to  maintain  a  certain  level  of  traf¬ 
fic  intensity  in  the  network  in  order  to  achieve  its  maxi¬ 
mum  capacity.  We  employ,  as  the  simplest  way  to  con¬ 
trol  the  network  traffic,  the  following  scheme  to  direct 
the  (re)transmissions  of  data  and  parity  packets  in  the 
type-n  hybrid  ARQ  protocol  described  above.  We  per¬ 
form  a  simple  form  of  channel  load  sensing  by  using 
the  length  of  the  transmitter  buffer  as  an  indicator.  If 
a  newly  arrived  data  block  enters  an  empty  buffer,  it 
will  be  sent  instantly  (neglecting  the  processing  time 
required  to  calculate  the  parity  bits).  Other  than  that,  ' 
all  (re)transmissions  take  place  in  a  slot  with  probability 
Pb.  Intuitively,  this  transmission  scheme  can  be  under¬ 
stood  as  follows.  When  the  transmitter  buffers  are  empty, 
the  network  traffic  is  probably  low,  i.e.,  there  is  a  high 
chance  that  the  transmission  will  be  successful.  Hence, 
the  newly  arrived  packet  should  be  sent  immediately  to 
reduce  the  delay.  On  the  other  hand,  a  failed  transmis¬ 
sion  and  a  filled  transmitter  buffer  both  indicate  that  the 
network  may  be  congested.  Hence,  it  is  reasonable  to  set 
the  (re)transmission  probability  to  pb  to  limit  the  traffic 
intensity. 


3.  EQUILIBRIUM  POINT  ANALYSIS 

First,  we  assume  that  the  system  changes  states  at 
each  slot  boundary  and  observation  is  made  at  the  begin¬ 
ning  of  each  slot,  just  before  any  data  block  arrives.  We 
say  that  a  user  is  in  state  S,-,  for  i  =  1, . . . ,  M,  when  the 
transmission  succeeds  in  the  previous  slot  and  there  are 
i  data  blocks  in  the  transmitter  buffer.  A  user  is  in  state 

Eij,  for  i  =  1 . M  and  y  =  1 . fi  -  1,  when  the 

transmission  fails  in  the  previous  slot,  the  stored  packet 
in  the  receiver  buffer  has  j  errors,  and  the  transmitter 
buffer  contains  i  data  blocks.  Similarly,  a  user  is  in  state 
EiJ^  when  the  transmission  fails  in  the  previous  slot,  the 
stored  packet  in  the  receiver  buffer  has  at  least  ti  errors. 


and  the  transmitter  buffer  contains  i  data  blocks.  More¬ 
over,  a  user  is  in  state  Idle  when  its  transmitter  buffer  is 
empty.  Fig.  2  shows  all  the  possible  states  for  a  user  in 
the  system. 

We  let  Rij  be  the  number  of  users  in  for  1  < 

I  <  M  and  1  <j<t\,  R,  be  the  number  of  users  in  5,, 
for  1  <  I  <  M,  and  Ri  be  the  number  of  users  in  Idle.  We 
note  that  the  following  constraint  is  satisfied: 

M  /  h  \ 

/=!  V  / 

where  0  <  /?/,  Rij  <  C/,  for  1  <  /  <  M  and  1  < ;  <  /i, 
and  0  <  /?/  <  f/.  Then  the  vector  X(0  =  [Ri,  R\, 
/?!  1,...,  Rm^  Rm,u  -^  t  >  ly 

forms  a  time-homogeneous  Markov  chain  that  models 
the  dynamics  of  the  system.  Unfortunately,  this 
dimensional  Markov  chain  is  too  complex  to  allow  exact 
analysis.  As  an  alternative,  wc  employ  the  EPA  tech¬ 
nique  [3-5]  to  approximately  compute  the  system  per-^ 
formance  and  analyze  the  system  stability. 

In  EPA,  it  is  assumed  that  the  system  is  at  equilib¬ 
rium,  i.e.,  the  expected  number  of  users  leaving  a  given 
state  is  exactly  equal  to  the  expected  number  of  users 
entering  the  same  state.  We  assume  that  the  system  oper¬ 
ates  about  the  equilibrium  point  instead  of  in  a  man¬ 
ner  that  is  randomly  distributed  according  to  the  station¬ 
ary  distribution  of  the  Markov  chain.  Then  we  approx¬ 
imate  the  steady-state  (average)  system  performance  by 
the  performance  at  the  equilibrium  point.  We  note  that 
it  is  possible  for  a  system  to  have  multiple  equilibrium 
points.  In  this  case,  the  system  stays  around  one  equilib¬ 
rium  point  for  a  random  amount  of  time,  then  moves  and 
stays  in  the  vicinity  of  another  equilibrium  point,  and  so 
on.  It  is  this  kind  of  oscillating  behavior  that  allows  us 
to  describe  the  stability  of  the  system. 
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1  5*7]  Rm.i 

Bm,2 
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Fig.  2,  States  of  a  user  in  the  DS-SSMA  system. 
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3.1.  Equilibrium-State  Transitions  of  a  User 

We  let  r/,  r\,  rij,...,  be  the  values  of  /?/, 
/?i,  /?ij, . . . ,  Rmjx  at  an  equilibrium  point,  respectively. 
We  note  that  all  of  the  values  can  be  nonnegative  real 
numbers,  but 

M  /  t\  \ 

+  (2) 

,=1  \  j=\  / 

We  approximate  the  system  dynamics  about  the  equilib¬ 
rium  point  by  assuming  that  each  user  resides  in  Idle 
independently  with  probability  ri/U.  Then  the  total  num¬ 
ber  of  users  in  Idle  is  binomial  distributed  with  mean 
r/.  Next,  we  focus  on  one  of  the  users  in  the  system. 
The  behavior  of  this  user  can  be  modeled  by  the  Markov 
chain  with  states  Idle,  Em,  ti  •  To  obtain  the 

state  transition  probabilities  of  the  Markov  chain,  we 
need  to  know,  for  ^  =  0, . . . ,  U  -  1,  the  probability,  Pk^ 
that  there  are  k  interferers  transmitting  in  the  same  slot. 
Given  that  a  user  is  transmitting  and  there  are  n  other 
users  in  Idle,  each  of  these  n  users  transmits  a  packet  in 
the  same  slot  with  the  same  probability  p/.  At  the  same 
time,  there  are  t/- 1  -  n  users  (in  other  states)  who  trans¬ 
mit  packets  with  probability  By  the  law  of  total  prob¬ 
ability,  we  see  that,  for  /:  =  0, . . . ,  U  -1. 

U  -  1  min(n,  k) 

'’•'X  X 

n=0  /=inax(0,it  +  /z  -  t/+ 1) 

•  Bin, l,Pi)BiU -\-n,k-  l,pb)BiU -  1, n, r,/U) 

(3) 

where  B(-,  •)  denotes  the  binomial  distribution,  i.e.,  for 
0  /  and  0  ^  1, 


B{j,j,p)  P''(^  -  py  ■'  (4) 

Next,  we  let  P(y ),  for;  =  0, . . . ,  fj  -  1,  be  the  prob¬ 
ability  that  there  are  j  errors  in  the  received  packet  given 
that  the  user  does  transmit  a  packet  in  the  current  slot. 
Moreover,  we  use  P(t\)  to  denote  the  probability  that 
there  are  at  least  t]  errors  in  the  received  packet  under 
the  same  condition.  It  is  easy  to  see  that,  for  y  =  0, . . . , 

u-\ 

PU)='^  PeU\k+l)Pk  (5) 

ik  =  0 

where  PeU\^)  is  the  conditional  probability  of  the  event 
that  there  are  j  errors  (when  j  =  t\,  it  means  at  least 
t{  errors)  in  the  received  packet  given  that  there  are 
k  simultaneously  active  users  in  the  system.  We  note 
that  PeU\^)  can  be  obtained  by  the  improved  Gaussian 
approximation  technique  [9,  10],  which  accounts  for  the 
bit-to-bit  error  dependence  within  a  packet.  The  detailed 
derivation  of  PeU\^)  can  be  found  in  [9]. 

In  this  section,  we  consider  the  case  where  the 
buffer  length  Af  >  2.  The  case  for  M  =  1  will  be  consid¬ 
ered  separately  in  Section  3.3.  We  divide  the  M(t\  +  1) 
+  1  states  into  seven  groups,^  namely,  {Idle},  {S\}, 
{Ei,i  ...,  E^tih  {^^,1  •••»  {Sih  and 

Ei^tih  for  2  <  /  <  M  ~  1.  States  in  dif¬ 
ferent  groups  behave  differently,  and  states  in  the  same 
group  behave  similarly.  With  this  classification,  it  is  easy 
to  obtain  the  state  transition  probabilities.  Fig.  3  shows 
the  state  transition  diagrams  of  a  user  with  state  tran¬ 
sitions  involving  a  sample  state  in  each  of  the  seven 

*When  A/  =  2,  we  have  only  five  groups:  {Idle},  {5i},  {£i, i  ...» 
Ri,ti }»  {S2)y  and  {£2,1  }• 


(a) 


Fig.  3.  State  transition  diagrams  of  a  user  when  M  >  2:  (a)  Idle;  (b)  Si  and  Eij,  for 
i<j<ti;  (c)  Sm  and  Emj  for  1  <y  <  /i;  (d)  Si,  for  2  <  /  <  Af  -  l(Af  >  3);  (e)  Eij,  for 
2</<Af-  1  and  1  <y  < /i(Af  >  3). 
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groups.  In  Fig.  3,  most  of  the  transitions  are  self-explana¬ 
tory.  However,  special  attention  should  be  given  to  those 
transitions  involving  Eij  in  Figs.  3b,  3c,  and  3e.  For  clar¬ 
ity,  only  the  cases  for  ri  -y+1  >j  are  shown  in  these  three 
figures.  For  t\ -j+ 1  <  y,  the  transition  probability  of  each 
of  the  vertical  transitions  and  self-loops  involving  Eij  is 
the  sum  of  two  terms  shown  in  the  figures:  one  from  the 
state  indexed  by  the  subscript  j  and  the  other  from  the 
state  indexed  by  the  subscript  t\  -j'+  1,  where  t\  -  j' 
+  1  =  y.  Moreover,  we  use  the  notation  x\\y  in  Fig.  3b 
to  represent  the  different  transition  probabilities  x  and 
y  for  the  cases  where  M  >  3  and  Af  =  2,  respectively. 
The  same  notation  is  also  used  in  Fig.  3d  to  represent 
the  different  transition  probabilities  for  the  cases  where 
i  9^  M  -  I  and  /  -  Af  -  1,  respectively. 


3.2.  Equilibrium  Equations 

Again,  we  consider  the  case  where  Af  >  2  in  this 
section.  At  equilibrium,  the  expected  inflow  to  each  of 
the  system  states.  Idle,  S\,  is  identical 

to  the  expected  outflow  from  it.  Using  vector  notation, 
we  let 


T  —  [r/,rj,ri,...,ri,  F/,  r^]  (6) 

where,  for  1  <  /  <  Af , 

r/ =  (7) 

Furthermore,  we  define  three  row  vectors  with  t\  ele¬ 
ments: 


[/,-i  n-2  1 

X  2  . ^(0)  (8) 

J=0  j=0  J 

P(y)  =  [0,...,0,PO),...,i>(;)]  (9) 

y  terms 

h(y)=[0,...,0,  ^  0,...,0]  (10) 

y-th  term 


From  Fig.  3a,  it  is  easy  to  see  that  the  expected  inflow 
to  and  the  expected  outflow  from  Idle  are,  respectively, 
given  by  ra/  and  r/,  where 


(11) 


From  Fig.  3b,  the  expected  inflow  to  and  the  expected 
outflow  from  are,  respectively,  given  by  rai  and  rj, 
where 


ai  =[0,(1  -pi){l  -  pb)  +  PiPhP(0), 

PiPbP,  {(1  - Pi)Pb\\Pb)P{0), 

-  Pi)Pb\\Pb}P,0,...,0f  (12) 

From  Fig.  3b,  the  expected  inflow  to  and  the  expected 
outflow  from  E\j,  for  1  <y  <  ^i,  are,  respectively,  given 
by  raij  and  r\j,  where 

ai,y  =  [piPU)Al  -Pi)PbP(j),(l  -pi)PbP{j) 

+  (1-P,)(l-P*)h(y),0 . Of  (13) 

From  Fig.  3c,  the  expected  inflow  to  and  the  expected 
outflow  from  Sm  are,  respectively,  given  by  raA/  and 
where 


P>  *  ^  jP/(1  Ph\  0, « ^  ,  0^ 

(Af  -  2)(f  1  +  1)  +  1  terms  t\  terms 

l-p*,0^^f 

t\  terms 


(14) 


From  Fig.  3c,  the  expected  inflow  to  and  the  expected 
outflow  from  Emj,  for  1  <y  <  fi,  are,  respectively,  given 
by  TSiMj  ''Af,y  »  where 


auj  =[  p,.^.,0,  PiPbP(J),PiPbP(j) 

(A/-2)((i  +  l)  +  lterms 

+ a(  1  -  PbMj ),  PbPU ), 

PbPU)  +  (i-PbMj)f  (15) 


From  Fig.  3d,  when  Af  >  3,  the  expected  inflow  to  and 
the  expected  outflow  from  5/,  for  2  <  /  <  Af  -  1,  are, 
respectively,  given  by  ra/  and  r/,  where 

a/  =[  p,..^.,0,  Pi(l  - Pb)y p> 

(/  -  2)(r|  +  1 )  +  1  terms  0  terms 

(1  -Pi)(l-Pb)  +PiPbP(0),PiPbP, 
{{^-Pi)Pb\\Pb}pm[0- -PdPbWPbW, 

0 . of  (16) 

From  Fig.  3e,  when  A/  >  3,  the  expected  inflow  to  and 
the  expected  outflow  from  Eij,  for  2  <  i  <  Af  -  1  and 


a/  =[1  -  A(1  -  Pm,(l  -Pi)PbP(0), 
(1  -pi)pbV,0,...,0f 
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I  <  j  <  ti,  are,  respectively,  given  by  ra,.;  and  r,,y, 
where 

a,,y  =[  0, . . . ,  0,  PiPbPU)>PiPb^U ) 

(/ -2)(ri  +  1)+  1  terms 

+  P/(l  -  Pb)Hj),  (1  -  Pi)PbPU\ 

(1  -  Pi)PbP{j)  +  (1  -  Pi)(t  "  Pb)^ii)y 
0,...,0f  (17) 


[r,,r][B,q]  =  b  (19) 


where  b  =  [  0, . . . ,  0, 


U]  and  B  is  obtained  by  de- 


1  +  1 )  -  1  terms 


leting  the  first  and  the  last  columns  of  the  matrix  A  -  I. 
Now,  we  partition  the  M(ri  +  1)  +  l-by-A/(t|  +  1)  matrix 
[B,  q]  into  the  form; 


[B,q] 


c 

C 


(20) 


By  rewriting  Eq.  (6)  and  equating  inflows  to  out¬ 
flows,  we  have  the  following  set  of  equilibrium  equa¬ 
tions  in  matrix  form: 

r[A,q]  =  [r,  17]  (18) 


where  c  is  a  row  vector  of  M(f\  +  1)  elements  and  C  is 
a  square  matrix  of  size  M{ti  +  !)•  Then  the  solution  of 
Eq.  (19)  is 

f  =  (b  -  r;c)C"'  (21) 


where  A  =  [a/,  ai,  ai.y, . . . ,  a,-,  a,-,y, . . . ,  aw,  Am.}\  and  q 
is  an  all-ones  column  vector  with  A/(ri+l)  +  1  elements.^ 
If  all  the  elements  of  a  solution  vector,  r,  of  the  equilib¬ 
rium  equations  in  (18)  lie  in  [0, 17],  then  r  defines  an 
equilibrium  point  about  which  the  system  is  assumed  to 
operate. 

First,  the  existence  of  at  least  one  equilibrium  point 
is  guaranteed  by  the  following  proposition,  whose  proof 
can  be  found  in  the  Appendix: 

Proposition  1:  The  set  of  equilibrium  equations  in  (18) 
has  an  odd  number  of  solutions,  and  all  elements  of 
every  solution  lie  in  (0,  U). 

Next,  we  proceed  to  find  all  the  equilibrium  points. 
We  note  that  the  first  M{t\  +  1)  +  1  equations  in  (18) 
are  linearly  dependent.  Hence,  we  can  remove,  say,  the 
(M(ri  +  1)  +  l)-th  equation  without  affecting  the  solu¬ 
tions  of  the  original  set  of  equations.  Since  all  elements 
of  A  are  functions  of  n,  it  is  simpler  to  solve  (18)  by 
first  expressing  r  =  [ri,  ri,  . . . ,  r*/,  r*/]  in  terms  of  r/. 
This  can  be  done  by  further  deleting  the  first  equation  in 
(18)  and  solving  the  resulting  system  of  linear  equations 
in  terms  of  r/.  Then  we  obtain  the  solutions  of  (18)  by 
substituting  ri,  ri, . . . ,  rw,  into  the  first  equation  to 
solve  for  r/. 

More  precisely,  consider  the  following  set  of  linear 
equations  obtained  by  removing  the  first  and  the  + 
1)  +  l)-th  equations  in  (18): 

^In  the  rest  of  this  paper,  we  will  use  the  same  notation  q  to  denote 
the  all-ones  column  vectors  of  different  sizes. 


We  note  that  we  have  assumed  the  fact  that  C  is  invert¬ 
ible  in  Eq.  (21).  This- is  guaranteed  by  the  following 
proposition,  whose  proof  is  given  in  the  Appendix: 

Proposition  2:  The  square  matrix  C  is  invertible  for 
each  r/  in  [0,  U]. 

Next,  we  define 

Fi  =  ri-  ra/  (22) 

Physically,  F/  represents  the  expected  net  outflow  from 
Idle  at  equilibrium.  We  note  from  Eqs.  (21)  and  (22)  that 
F/  is  a  function  of  r/  only.  To  indicate  this  dependence, 
we  use  the  notation  F/(r/).  Finally,  we  make  use  of  the 
first  equation  in  (18),  which  is  equivalent  to 

F/(r/)  =  0  (23) 

So,  if  Eq.  (23)  has  a  solution  in  [0,  U],  then  the  whole  set 
of  equilibrium  equations  has  a  legitimate  solution  when 
all  the  ri,  ri, . . . ,  r*/,  obtained  by  Eq.  (21)  lie  in  the 
same  interval.  By  Proposition  1,  this  is  guaranteed. 

As  a  summary,  we  can  find  the  equilibrium  point(s) 
of  the  system  by  the  following  algorithm: 

1.  Construct  A  as  a  function  of  r/  in  [0,  U]  using 
Eqs.  (11H17). 

2.  Find  the  zero(s)  of  F/(r/)  in  [0,  U]  using  Eqs. 
(21)  and  (22). 

3.  Obtain  ri,  fi,...,  r^,  tu  by  Eq.  (21)  for  each 
zero. 
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Fig.  4.  F/  as  a  function  of  r/. 

4.  Each  solution  {r/,  ri,  ri,...,  Ta/}  gives  an 
equilibrium  point. 

For  example,^  we  consider  the  sample  coding  structure 
[2]  when  iV  =  31,  L  =  500,  and  ti  =  54.  In  Fig.  4,  we  plot 

^For  this  and  all  the  following  numerical  examples,  we  employ  the 
same  sample  coding  structure. 


Fj  as  a  function  of  r/  for  different  values  of  pb  by  fixing 
U  =  60,  M  =  6y  and  /?,-  =  0.2.  We  observe  that  F/  has 
a  single  zero  for  pb  =  0.2  and  0.5.  When  pb  increases 
to  0.9,  F/  has  three  zeros.  We  note  that  each  of  these 
zeros  corresponds  to  an  equilibrium  point  of  the  system. 
Hence,  we  conclude  that  for  some  values  of  pi  and  pb, 
the  system  can  have  multiple  equilibrium  points. 


3.3.  M  -  1  Case 

When  M  -  I,  since  the  buffer  can  hold  only  one 
packet,  any  newly  arrived  packet  will  only  see  either 
an  empty  buffer  or  a  full  buffer  filled  by  the  packet 
that  fails  in  the  previous  transmission.  Hence,  a  user 
will  never  enter  the  state  iS*],  and  it  makes  no  differ¬ 
ence  if  we  remove  Si  from  the  Markov  chain  model  of 
a  user.  Figs.  5a  and  5b  show  the  resulting  state  tran¬ 
sition  diagrams  of  a  user  with  state  transitions  involv¬ 
ing  Idle  and  . . . ,  E\^ti }»  respectively.  By  remov¬ 
ing  5i,  the  resulting  Markov  chain  becomes  irreducible. 
Thus,  Propositions  1  and  2  still  hold  with  appropri¬ 
ate  modifications,  and  the  procedure  for  finding  the 
equilibrium  points  of  the  system  is  the  same  as  be¬ 
fore. 


PiP(ti) 


Fig.  5.  State  transition  diagrams  of  a  user  when  M  -  1:  (a)  Idle;  (b)  E\j,  for 


Buffered  Type-II  Hybrid  ARQ  Protocol 


211 


3.4.  Equilibrium  Throughput,  Delay,  and  Packet 
Rejection  Probability 

After  an  equilibrium  point  of  the  system  has  been 
obtained  by  the  method  described  in  the  previous  sec¬ 
tion,  we  can  calculate  the  system  throughput,  delay,  and 
packet  rejection  probability  at  this  equilibrium  point.  The 
system  throughput,  5,  is  defined  as  the  average  num¬ 
ber  of  successfully  transmitted  packets  per  slot.  Since 
a  user  will  go  to  Idle  or  S;  in  the  next  slot  when  a 
(re)transmission  of  the  user  occurs  and  succeeds,  it  is 
easy  to  see  from  Fig.  3  that 

S  =  p,P(0)r/ 


after  it  is  admitted  to  the  transmitter  buffer.  Because  of 
Eq.  (26),  we  can  apply  Little’s  theorem  [11]  to  obtain 
the  system  delay: 

W  W 

D=l  +  -^  =  1  +  -^  (28) 

A  6 

The  packet  rejection  probability,  Prej,  is  defined  as 
the  probability  of  the  event  that  a  packet  generated  by 
a  data  source  is  rejected  from  entering  the  transmitter 
buffer  of  the  corresponding  user.  As  mentioned  above,  an 
arriving  packet  is  rejected  from  entering  the  transmitter 
buffer  of  a  user  only  when  the  user  is  in  one  of  the  states 
Sut  £■*/,!»  •  •  •  •  ^M,t\  ■  Hence, 


+ 


Z 

1  =  1 


PiP(0)r.-  +  £ 


y=i 


'1  -j 

Pb  X 

k  =  0 


(24) 


We  use  A  to  denote  the  average  number  of  packets 
accepted  by  the  system  (the  transmitter  buffers)  per  slot. 
Since  a  user  in  any  states  except  Smj 
can  accept  an  arriving  packet  to  its  transmitter  buffer, 
we  have 


A  =  pi 

k+X  '■'  ’4 

1=1 

\  y=l 

By  adding  together  the  equilibrium  equations  in  (18)  of 
the  states  with  the  same  number  of  packets  in  the  trans¬ 
mitter  buffer  and  substituting  the  resulting  M  equations 
recursively,  we  can  show  that 


5  =  A  (26) 

Physically,  Eq,  (26)  means  that  the  average  number  of 
packets  accepted  to  the  system  is  equal  to  the  system 
throughput.  This  relationship  is  intuitive  since  the  system 
does  not  act  as  a  sink  to  packets. 

At  equilibrium,  the  average  number  of  packets  wait¬ 
ing  in  the  system,  W,  is 


The  system  delay,  D,  is  defined  as  the  average  number  of 
time  slots  required  to  successfully  transmit  a  data  packet 


Combining  Eqs.  (2),  (25),  (26),  and  (29),  we" obtain  the 
following  equation  relating  S  and  Pref 

^+PiPreJ=Pi  (30) 

To  verify  the  EPA  technique,  we  perform  Monte 
Carlo  simulations  to  obtain  estimates  of  the  system 
throughput,  delay,  and  packet  rejection  probability  and 
use  these  estimates  to  check  the  validity  of  the  EPA 
results.  Figs.  6a-c  show  the  system  throughput,  delay, 
and  packet  rejection  probability  as  functions  of  pb, 
respectively,  for  the  system  with  U  =  60  and  /»,•  =  0.2. 
Depending  on  the  value  of  M,  the  system  has  a  single 
equilibrium  point  when  pb  is  small.  However,  the  system 
has  three  equilibrium  points  when  pb  is  large  (see  Fig.  4). 
When  there  are  three  equilibrium  points,  we  show,  in  Fig. 
6,  the  EPA  results  for  the  two  stable  equilibrium  points 
(defined  in  Section  3.5).  We  see  that  the  stimulation  and 
EPA  results  match  very  well  when  there  is  a  single  equi¬ 
librium  point.  When  there  are  three  equilibrium  points, 
the  simulation  results  fall  between  the  EPA  results  of  the 
stable  equilibrium  points.  An  interesting  observation  is 
that  except  for  the  case  of  M  =  1,  the  simulation  results 
are  closer  to  the  equilibrium  point  with  low  throughput 
(high  packet  rejection  probability)  when  pb  is  large.  As 
expected,  a  longer  transmitter  buffer  helps  to  reduce  Prej- 
Next,  we  consider  the  case  with  t/  =  60  axidpt  -  0.4. 
Figures  7a-c  show  the  corresponding  throughput,  delay, 
and  packet  rejection  probability  curves,  respectively.  We 
note  that  the  system  has  only  one  equilibrium  point  when 
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Fig.  6.  Equilibrium  throughput,  delay,  and  packet  rejection  probability  Fig.  7.  Equilibrium  throughput,  delay,  and  packet  rejection  probability 

{pi  =  0.2):  (a)  throughput;  (b)  delay;  (c)  packet  rejection  probability.  {pi  -  0.4):  (a)  throughput;  (b)  delay;  (c)  packet  rejection  probability. 
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M  -  6  and  10.  From  the  figures,  we  see  that  the  simula¬ 
tion  results  also  match  the  EPA  results  well  in  this  case. 
Although  the  maximum  throughput  achieved  in  this  case 
is  larger  than  that  in  the  case  where  p-,  =  0.2,  the  packet 
rejection  probability  is  very  high  even  when  M  is  large. 
This  indicates  that  the  system  cannot  handle  an  input  rate 
of  Pi  =  0.4.  Moreover,  this  result  also  indicates  that  the 
maximum  system  throughput  alone  may  not  be  a  good 
measure  of  the  system  performance.  More  detailed  dis¬ 
cussion  on  the  capacity  of  the  system  will  be  given  in 
Section  4. 

We  conclude  from  Figs.  6  and  7  that  the  EPA  tech¬ 
nique  can  provide  accurate  approximations  of  the  sys¬ 
tem  throughput,  delay,  and  packet  rejection  probability 
when  there  is  a  single  equilibrium  point,  i.e.,  when  we 
can  assume  that  the  system  operates  in  the  vicinity  of 
the  equilibrium  point.  When  there  are  multiple  equilib¬ 
rium  points,  the  system  behavior  is  more  complex  (see 
Section  3.5). 

3  System  Stability 

It  is  demonstrated  in  the  previous  section  that  the 
system  can  have  multiple  equilibrium  points.  Roughly 
speaking,  the  system  oscillates  among  some  equilibrium 
points  in  this  case.  The  existence  of  these  oscillations 
determines  the  stability  behavior  of  the  system. 

First,  we  classify  the  equilibrium  points  according 
to  stability: 

Definition  1:  An  equilibrium  point  {r/,  ri,  ri,i,..., 

}  is  stable  if  Fj(r/),  the  derivative  of  F/  at  r/.  is 
positive.  Otherwise,  it  is  unstable. 


unstable  and  the  boundary  ones  are  stable.  The  physical 
interpretation  above  indicates  that  a  system  with  a  single 
stable  equilibrium  point  (e.g.,  U  =  60,  M  =  6,  pi  -  0.2, 
pt,  -  0.5)  stays  in  the  vicinity  of  the  equilibrium  point. 
In  this  case,  we  say  the  system  is  stable.  For  the  system 
with  two  stable  equilibrium  points  (e.g.,  U  =  60,  M  = 
6,  Pi  -  0.2,  pb  -  0.9),  oscillation  occurs  between  those 
points  with  stays  of  random  durations.  In  this  case,  we 
say  the  system  is  bistable.  Figs.  8a  and  8b  show  the  time 
trajectories  of  /■/  obtained  in  a  simulation  run  for  the  two 
sample  systems.  The  results  in  the  figures  verify  the  sys¬ 
tem  dynamics  predicted  by  the  EPA  technique. 

For  the  bistable  system  shown  in  Fig.  8b,  one  of  the 
two  stable  equilibrium  points  (“good”)  corresponds  to  a 
high  system  throughput  and  a  low  packet  rejection  prob¬ 
ability,  whereas  the  other  one  (“bad”)  corresponds  to  a 
low  system  throughput  and  a  high  packet  rejection  prob- 


(a)' 


Styihafrr 


Physically,  this  definition  is  based  on  the  following 
observation.  If  Fj(r/)  >  0,  F/  increases  (decreases)  as  r/ 
increases  (decreases)  slightly  from  the  equilibrium  value. 
We  note  that  F/  represents  the  expected  net  outflow 
from  Idle.  When  F/  increases  (decreases),  more  (less) 
users  are  going  to  leave  Idle,  and,  hence,  r/  decreases 
(increases).  This  regulatory  effect  keeps  the  system  at  the 
equilibrium  point.  Therefore,  the  equilibrium  point  is  sta¬ 
ble.  On  the  contrary,  if  Fj(r/)  <  0,  an  increase  (decrease) 
in  r/  will  cause  a  further  increase  (decrease)  in  r/  itself. 
Hence,  upon  a  slight  statistical  fluctuation,  the  system 
will  leave  the  equilibrium  point  in  this  case,  i.e.,  this 
equilibrium  point  is  unstable. 

From  Fig.  4,  we  see  that  for  the  cases  of  Pb  =  0.2 
and  0.5,  the  single  equilibrium  points  are  stable.  On  the 
other  hand,  for  pb  =  0.9,  the  middle  equilibrium  point  is 


(b) 
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Slot  index 

Fig.  8.  Time  trajectories  of  r/:  (a)  stable  system;  (b)  bistable  system. 
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ability  (cf.  Fig.  6).  Since  the  bistable  system  oscillates 
between  the  two  stable  equilibrium  points,  the  actual 
system  throughput  (packet  rejection  probability)  should 
be  a  weighted  average  of  the  throughputs  (packet  rejec¬ 
tion  probabilities)  at  the  two  stable  equilibrium  points. 
The  simulation  result  shown  in  Fig.  8b  seems  to  indi¬ 
cate  that  the  system  spends  much  more  time  about  the 
“bad”  equilibrium  point  than  the  “good”  one,  i.e.,  the 
actual  system  throughput  (packet  rejection  probability) 
is  close  to  that  at  the  “bad”  equilibrium  point.  In  fact, 
we  observe  from  extensive  simulations  that  for  M  >  4 
and  some  large  pb,  once  the  bistable  system  enters  the 
vicinity  of  the  “bad”  equilibrium  point,  it  is  extremely 
unlikely  that  it  will  leave  it.  Intuitively,  this  phenomenon 
can  be  explained  by  the  fact  that  for  large  M  and  pb^  once 
the  transmitter  buffers  are  full,  they  act  as  continuous 
packet  sources.  As  a  result,  the  air  interface  becomes 
very  congested.  Since  the  DS-SSMA  technique  is  inter¬ 
ference  limited,  it  is  unlikely  that  most  of  the  transmis¬ 
sions  succeed.  Hence,  the  transmitter  buffers  do  not  clear 
up  and  the  system  stays  at  the  “bad”  equilibrium  point. 
This  explains  why  the  simulation  results  for  large  M  in 
Fig-  6  are  close  to  the  “bad”  equilibrium  points  when  the 
system  is  bistable.  In  summary,  we  should  avoid  operat¬ 
ing  in  the  region  where  the  system  is  bistable. 


4.  SYSTEM  DESIGN 

In  this  section,  we  consider  some  issues  affecting 
the  design  of  a  DS-SSMA  system  with  the  type-II  hybrid 
ARQ  protocol.  We  assume  that  the  total  number  of  users 
in  the  system  is  fixed  to  U,  In  practice,  U  may  represent 
the  maximum  number  of  users  allowed  in  the  system.  We 
also  fix  the  error  correcting  capability  of  C\  to  t\  based 
on  complexity  considerations.  The  system  design  crite¬ 
rion  we  consider  is  that  the  packet  rejection  probability 
should  always  be  smaller  than  a  predetermined  threshold 
P%j  under  dl  allowable  input  loads  p/.  In  practice,  this 
threshold  can  be  set  to  the  maximum  acceptable  packet 
rejection  probability. 


4,1,  Transmitter  Buffer-Length  Selection 

First,  we  develop  a  procedure  to  select  the  transmit¬ 
ter  buffer  length  M  to  satisfy  the  packet  rejection  crite¬ 
rion,  namely  Prej  ^  P*rej-  From  the  discussion  in  Sec¬ 
tion  3.5,  we  know  that  it  is  undesirable  to  operate  when 
the  system  is  bistable.  Hence,  we  only  consider  the  sta¬ 


ble  region  of  operation,  i.e.,  those  values  of  p/,  ai 
M  for  which  the  system  is  stable.  From  the  numeric 
results  in  Section  3.4,  we  observe  that  the  packet  reje 
tion  criterion  is  not  satisfied  for  all  values  of  p,,  p/,,  ai 
M.  This  leads  us  to  make  the  following  definition  of  tl 
maximum  admissible  input  load  p*(M)  for  each  M: 

Definition  2:  Let  “  {pb  ^  (0, 1]:  r/(p/,pf„M)  I 
the  only  solution  of  F/(r/)  =  0  in  [0,  U]  and  F]{ri)  >  ( 
be  the  set  of  pb  such  that  the  system  is  stable  for  fixed  ^ 
and  M .  For  each  buffer  length  Af,  define  the  maximu 
admissible  input  load  p*(M)  as 

p*{M)  =  mf{/7,  e  [0,  \]:Prei{pi,Pb,M) 

>  P*rej,  for  all  Pb  e  }  (3 

If  the  set  in  Eq.  (31)  is  empty,  we  set  p%i(M)  =  1. 

The  above  definition  means  that  for  a  fixed  M,  if  pi 
p%j(M)  we  can  always  choose  a pb  such  that  Prej  ^  P^n 
Figure  9  shows  the  plot  of  p*(Af )  against  M  for  di 
ferent  values  of  Pjey*  We  observe  that  p*(M)  levels  o 
when  M  is  large.  This  implies  that  the  length  of  the  tranj 
mitter  buffer  (if  it  is  large  enough)  does  not  affect  max 
mum  admissible  input  load.  Intuitively,  since  the  tram 
mitter  buffer  is  only  used  to  prevent  packet  rejectioi 
once  its  length  is  large  enough  that  the  packet  rejectio 
criterion  is  satisfied,  further  increase  in  its  length  has  n 
effect  on  the  system  capacity,  which  is  basically  detei 
mined  by  the  coding  capability  t\.  More  precisely,  w 
can  define  the  capacity  of  the  system  as  follows: 
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Dednition  3:  Given  U  and  Ji,  the  capacity  of  the  system 
is  defined  as 

5*  liminf{sup  p*(Af)£/}  (32) 

p%j  lO  M>1 

For  the  sample  coding  structure  considered,  5*  =  13.2. 
This  is  the  maximum  average  number  of  packets  per  slot 
that  the  system  can  handle  given  U  =  60  and  t\  =54. 

Based  on  the  development  above,  we  can  construct 
the  following  procedure  to  select  the  transmitter  buffer 
length: 

1.  Given  f/  and  ti,  specify  the  maximum  input  load 
Li  (packets  per  slot).  If  L/  >  S*,  increase  t\  to 
make  Li  <S*. 

2.  Choose  a  desirable  P*ej  and  find  the  minimum 
M  such  that  p*(M)  >  Li/U. 

3.  Set  transmitter  buffer  length  to  this  M. 

For  example,  for  the  packet  rejection  threshold  P*re}  = 
10“^  and  a  load  Li  =  12  packet  per  slot,  M  =  6  is  enough. 

4.2.  Desirable  Region  of  Operation 

After  choosing  the  length  of  the  transmitter  buffer, 
we  can  determine  the  region  of  operation  within  which 
the  packet  rejection  criterion  is  satisfied.  In  other  words, 
for  each  p,-  <  L]/U,  we  determine  the  range  of  pb  such 
that  Pre]  ^  P*rej-  As  discussed  in  Section  3.5,  we  should 
avoid  operating  in  the  bistable  region.  Hence,  we  limit 
the  desirable  region  of  operation  to  be  a  subset  of  the 
stable  region.  Then  it  is  easy  to  see  that  the  desirable 
region  of  operation  can  be  defined  as  follows: 

Definition  4:  Given  t/,  fi,  and  M,  the  desirable  region 
of  operation  is  defined  as 

={iPhPb)'Pl  € 

and  PrejiPhPb,  Af )  ^ 

We  note  that  in  Eq.  (33),  L,/U  <  p*(M)  because  of  the 
buffer  length  selection  procedure  described  in  the  previ¬ 
ous  section.  Hence,  'Du  is  nonempty. 

Figure  10  shows  the  desirable  region  of  operation 
when  Li  =  12  packets  per  slot.  To  satisfy  the  packet 
rejection  criterion,  we  have  to  operate  inside  this  region. 
As  shown  in  the  previous  section,  we  know  that  M  = 
6  is  enough  for  U  =  60  and  P*ej  -  10“^.  The  shape  of 
the  region  follows  the  intuition  that  we  can  have  more 
choices  of  pb  when  the  load  p,  is  small. 


After  constructing  the  desirable  region  of  operation, 
we  can  have  two  strategies  of  choosing  p^  depending  on 
whether  we  know  (can  estimate)  the  instantaneous  value 
of  Pi.  If  we  do  know  the  instantaneous  value  of  p„  we 
can  choose  the  value  of  p*  that  maximizes  the  throughput 
or  minimizes  the  delay  for  this  p,  by  using  curves  similar 
to  those  shown  in  Figs.  6  and  7.  Otherwise,  we  need  to 
choose  a  fixed  value  of  pb  so  that  the  packet  rejection 
criterion  is  satisfied  for  all  p,-  ^  Li/U .  For  example,  we 
can  choose  pb  =  0.65  by  consulting  Fig.  10. 

5.  CONCLUSIONS 

In  this  paper,  we  have  employed  the  EPA  technique 
to  study  the  slotted  DS-SSMA  packet  radio  network 
employing  the  type-II  hybrid  ARQ  protocol  with  finite- 
length  transmitter  buffers.  We  have  analyzed  the  system 
stability  under  various  operating  conditions  and  com¬ 
puted  the  system  throughput,  delay,  and  packet  rejec¬ 
tion  probability.  We  have  also  pertbnued  computer  sim¬ 
ulations  to  verify  the  results  given  by  EPA.  We  have 
observed  that  the  system  exliiMss  bistable  behavior  in 
some  situations,  which  we  shoEld  avoid  in  designing  the 
system.  Moreover,  we  have  given  a  procedure  to  deter¬ 
mine  the  required  length  of  the  tansmlfier  buffers  and 
the  desirable  region  of  operation  based  on  a  predeter¬ 
mined  performance  requirement  for  packet  rejection. 

APPENDIX 

In  this  Appendix,  we  prove  Propositions  I  and  2. 
To  this  end,  we  need  the  following  theorem  concerning 
an  irreducible  Markov  chain  from  [12]: 
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Theorem  1:  Let  P  be  the  state  transition  matrix  of  an 
irreducible  aperiodic  Markov  chain  with  finitely  many 
states.  Then  there  is  a  unique  solution,  tt,  satisfying 

7r[P,q]-[7r,l]  (34) 

This  solution  tt  represents  the  stationary  distribution  of 
the  Markov  chain  and  each  element  of  x  lies  in  (0,  1). 

First,  for  each  fixed  rj  in  [0,  L^],  we  see  that  the 
matrix  A  represents  the  state  transition  matrix  of  an  irre¬ 
ducible  aperiodic  Markov  chain  with  states  {Idle,  S\, 
£i  1, . . . ,  Emjx  }•  Hence,  by  Theorem  1,  for  each  r/  in 
[0,  U],  the  corresponding  Markov  chain  has  the  station¬ 
ary  distribution  x(r/).  Comparing  Eqs.  (18)  and  (34),  we 
observe  that  f/T(r/)  is  a  solution  to  Eq.  (18)  if  xi(r/)  - 
ri/U,  where  xi  is  the  first  element  of  the  vector  x.  Now, 
we  consider  the  function /(r/)  =  7ri(r/)-  r//f/.  It  is  easy 
to  see  that  f{rj)  is  a  continuous  function  in  [0,  U]  as 
each  element  of  A  is  a  continuous  function  in  [0,  t/]. 
Moreover,  by  Theorem  1,  since  xi(r/)  lies  in  (Q,i)  for 
all  r/  €  [0,  U],  we  have/(0)  >  0  and  f{U)  <  0.  Hence, 
/(r/)  =  0  has  an  odd  number  of  solutions  in  (0,  U),  and 
so  does  Eq.  (18).  This  proves  Proposition  1. 

Next,  for  each  fixed  r/  in  [0,  U],  we  modify  the 
Markov  chain  above  by  deleting  the  state  Idle  and  pass¬ 
ing  the  original  transitions  going  into  Idle  to 
is  easy  to  see  that  the  resulting  Markov  chain  {M(t\  +  1) 
states)  is  still  irreducible  and  aperiodic.  Let  the  state  tran¬ 
sition  matrix  of  the  resulting  Markov  chain  be  A.  Again, 
by  Theorem  1,  this  modified  Markov  chain  has  a  unique 
positive  stationary  distribution,  x,  i.e.,  there  is  a  unique 
X  such  that 

x[A--I,q]  =  [  0^^^  1]  (35) 

M{t\  + 1)  terms 

Hence,  the  rank  of  the  matrix  [A  - 1,  q]  must  be  M(t\  + 
1).  Moreover,  as  for  the  original  Markov  chain,  the  last 
column  of  the  matrix  A  -  I  can  be  written  as  a  linear 
combination  of  the  other  columns.  So,  the  rank  of  A  - 1 
must  be  M{t\  +  1)  -  1.  But  we  observe  that  the  M{t\  + 
\yhy-M{t\  +  1)  matrix  C  can  be  obtained  by  removing 
the  {M{t\  +  l))-th  column  from  [A  - 1,  q].  Therefore,  the 
rank  of  C  is  M{t\  +  1),  i.e.,  it  is  invertible.  This  proves 
Proposition  2. 
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Abstract — In  this  paper,  we  consider  signal  reception  in  mul- 
ticarrier  code-division  multiple-access  (MC-CDMA)  systems.  A 
blind  adaptive  algorithm  is  proposed  to  determine  a  weight  vector 
which  optimally  combines  the  desired  signal  contributions  from 
different  carriers  while  suppressing  noise  and  interference.  No 
knowledge  of  the  channel  conditions  (fading  coefficients,  signa¬ 
ture  sequences  and  timing  of  interferers,  statistics  of  other  noises, 
etc.)  nor  any  training  sequence  is  required.  The  performance  is 
examined  for  Rayleigh  fading  channels.  Results  show  that  the 
proposed  algorithm  performs  well  and  is  robust  to  the  near-far 
problem.  Hence,  the  results  show  that  MC-CDMA  systems  are 
attractive  candidates  for  future  CDMA  systems. 

Index  Terms — ^Adaptive  signal  detection,  code-division  multiple 
access,  fading  channels,  interference  suppression,  multicarrier 
modulation. 

I.  Introduction 

Recently,  different  multicarrier  code-division  multiple- 
access  (MC-CDMA)  systems  [1]  have  been  proposed  and 
investigated.  With  the  possibility  of  efficient  fast  Fourier  trans¬ 
form  (FFT)  implementations  [2],  MC-CDMA  systems  can  be 
of  practical  interest.  In  one  of  these  systems,  direct-sequence 
(DS)  spreading  is  employed  together  with  multicarrier  mod¬ 
ulation  [3]  to  derive  the  benefits  of  both  techniques.  The 
maximal  ratio  combiner  (MRC)  is  adopted  to  counteract  fading 
and  narrow-band  interference.  However,  the  use  of  the  MRC 
requires  the  estimation  of  each  fading  coefficient  associated 
with  each  carrier.  When  interference,  especially  multiple- 
access  interference  (MAI),  and  noise  other  than  additive  white 
Gaussian  noise  (AWGN)  are  also  present,  not  only  does  it 
become  more  difficult  to  estimate  the  fading  coefficients,  but 
the  statistics  of  the  interference  and  the  noise  also  have  to  be 
estimated  for  optimal  reception.  Explicit  estimation  of  these 
quantities  can  ^  inefficient  and  might  not  be  able  to  adapt 
quickly  to  changes  in  channel  conditions. 
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In  [4]  and  [5],  a  blind  adaptive  receiver  has  been  proposed 
for  DS  systems.  Here,  we  propose  a  blind  adaptive  algorithm 
for  the  MC-CDMA  systems  defined  in  [3]  for  fading  channels. 
The  algorithm  determines  a  weight  vector  which  optimally 
combines  the  contributions  from  different  carriers.  The  desired 
signal  contributions  are  constructively  combined,  while  noise 
and  interference  are  suppressed.  No  knowledge  of  the  fading 
coefficients,  the  signature  sequences  and  the  timing  of  the 
interferers,  the  statistics  of  other  noise,  etc.,  is  assumed,  and 
no  training  sequence  is  required.  The  proposed  algorithm 
yields  a  weight  vector  that  maximizes  the  signal-to-noise 
ratio  (SNR)  and  in  general  performs  better  than  the  MRC. 
Under  some  channel  conditions  when  the  MRC  is  optimal, 
the  proposed  algorithm  reduces  to  maximal  ratio  combining. 
In  these  cases,  the  noise  and  interference  across  different 
carriers  are  uncorrelated.  However,  this  property  does  not 
hold  generally.  Channels  with  the  near-far  problem  provide 
common  counterexamples  in  which  interference  contributions 
in  different  channels  are  correlated  and  the  MRC  fails  to 
achieve  MAI  cancellation.  Different  methods  [6],  [7]  with 
different  levels  of  complexity  have  been  proposed  to  tackle  the 
near-far  problem  in  DS  systems.  With  multicarrier  modulation, 
the  proposed  algorithm  can  exploit  the  correlation  between 
the  received  signals  across  different  carriers  to  cancel  MAI 
and,  hence,  alleviate  the  near-far  problem.  This  property 
distinguishes  MC-CDMA  systems  from  DS  systems.  The 
simplicity  of  the  proposed  algorithm  used  with  MC-CDMA 
results  in  an  attractive  system.  The  algorithm  is  presented  in 
Section  III,  and  its  performance  is  discussed  in  Section  IV. 

Antenna  arrays  are  shown  [4],  [5],  [8]  to  be  effective  in 
boosting  received  signal  energy  and  suppressing  interference  in 
DS  spread-spectrum  communication  systems.  In  some  cases,  it 
may  be  desirable  to  use  antenna  arrays  in  MC-CDMA  systems. 
It  turns  out  that  the  proposed  algorithm  for  MC-CDMA  signal 
reception  can  be  readily  generalized  to  incorporate  the  use  of 
antenna  arrays.  The  directions  of  arrival  of  different  carriers  of 
the  signals  are  not  required  to  be  known  a  priori.  The  proposed 
scheme  implicitly  determines  the  directions  of  arrival,  the 
fading  coefficients,  and  the  noise  statistics  to  yield  the  optimal 
weight  vector.  This  generalization  is  contained  in  Section  V. 

In  Section  VI,  we  present  numerical  examples  to  verify  vari¬ 
ous  results  in  the  paper.  Conclusions  are  drawn  in  Section  VII. 

II.  System  Model 

In  this  section,  we  describe  the  model  of  the  MC-CDMA 
system.  We  assume  that  there  are  K  simultaneous  users,  which 
use  the  same  M  carriers  in  the  system. 
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The  A;th  user,  for  I  <k  <  K,  is  provided  with  a  random 
signature  sequence  given  by 


,(k)  _  (  W  (fc)  (fc) 

t  —  )  «o  >  *^1  ’  ’  "vv-t> 


(1) 


where  the  elements  are  modeled  as  independent  and  iden- 
tically  distributed  (i.i.d.)  random  variables  such  that  Pr(a-  — 
-1)  =  Pr(a|*'^  =  1)  =  1/2.  The  same  signature  sequence 
is  used  to  modulate  each  of  the  M  carriers  of  the  kth  user. 

The  kth  transmitter,  for  1  <  A:  <  A",  generates  a  stream  of 
data  symbols  given  by 

b(>^)  =  (2) 

The  data  symbols  bf^  are  random  variables  with  £;[|6^P]  = 
1.  Each  data  symbol  is  multiplied  by  N  chips  of  the  signature 
sequence. 

The  kth  transmitted  signal  can  be  expressed  as  the  real  part 
of  the  complex  signal  given  by 

M  _  f  oo  1 

E  E  (3) 

m=l  U=-cxD  ) 

The  parameter  Pk  is  the  power  for  each  carrier  of  the  fcth 
transmitted  signal,  Wm  is  the  frequency  of  the  mth  carrier, 
and  the  parameters  Cm^  can  be  used  as  a  code  across  the  M 
carriers  of  the  A:th  transmitter,  or  can  be  chosen  to  condition 
the  peak-to-average  power  ratio  of  the  kth  transmitted  signal 
[9].  For  simplicity,  we  assume  that  =  L  We  assume 

that  the  chip  waveform  ^(f)  is  bandlimited  and  the  carrier 
frequencies  are  well  separated  so  that  adjacent  frequency  bands 
do  not  interfere  with  each  other.  We  also  assume  that  is 
normalized  so  that  dt  —  Tc.  The  parameter  Tc  is 

the  delay  between  consecutive  chips.  Since  each  data  symbol 
is  multiplied  by  N  chips  on  a  carrier,  the  symbol  interval  is 
NTc. 

Without  loss  of  generality,  we  consider  the  signal  from  the 
first  transmitter  as  the  desired  signal  and  the  signals  from  all 
other  transmitters  as  interfering  signals  throughout  the  paper. 

We  now  describe  the  channel  model.  We  assume  that  the 
channel  is  a  frequency  selective  fading  channel.  By  suitably 
choosing  M  and  the  bandwidth  of  'ipit)  [3],  we  can  assume 
that  each  carrier  undergoes  independent  frequency  nonselec- 
tive  slow  Rayleigh  fading.  We  also  assume  the  presence  of 
other  noise  sources  that  are  independent  of  the  user  signals, 
namely,  AWGN  with  power  spectral  density  No  and  wide- 
sense  stationary  narrow-band  interference. 

The  received  signal  in  complex  analytic  form  is  given  by 

K  M 
r[t)  =  E  E 

A:=l  m— 1 

■(  E 

— oo  ^ 

„.+«(<)  (4) 

where  Uk,  m  accounts  for  the  overall  effects  of  phase  shift  and 
fading  for  the  mth  carrier  of  the  klh  user,  T/t  represents  the 


Fig,  1.  Receiver  for  the  first  user  in  the  MC-CDMA  system. 


delay  of  the  fcth  user  signal,'  and  n(f)  represents  other  noise. 
We  model  ak,m,  for  k  =  and  m  =  1,  M, 

as  i.i.d.  zero-mean  complex  Gaussian  random  variables  so 
that  the  amplitude  of  each  carrier  is  Rayleigh  distributed.  We 
assume  that  synchronization  has  been  achieved  with  the  first 
transmitted  signal.  Therefore,  the  delay  of  the  first  transmitted 
signal  Ti  can  be  taken  to  be  zero. 

We  consider  the  receiver  shown  in  Fig.  1.  It  consists  of 
M  branches.  Each  branch  consists  of  a  correlator  and  an 
appropriate  weight,  and  is  responsible  for  the  demodulation 
of  one  carrier.  The  correlator  on  the  mth  branch  consists 
of  a  chip-matched  filter  and  a  combiner  that  combines  the 
contributions  from  different  chips  according  to  the  signature 
sequence  of  the  first  transmitter.  We  assume  that  the  chip 
waveform  and  the  chip-matched  filter  are  chosen  to  satisfy  the 
Nyquist  criterion  so  that  there  is  no  inter-chip  interference. 
The  weight  vector  w  =  [wi,  W2,  whose  value 

is  determined  in  the  following  section,  is  an  Af-dimensional 
vector  that  optimally  combines  the  contributions  from  the  M 
branches  to  give  the  decision  statistic. 

III.  Weight  Vector  Determination 

We  consider  symbol-by-symbol  detection.  Without  loss  of 
generality,  we  consider  the  detection  of  the  symbol  bg  \  The 
output  of  the  correlator  on  the  mth  branch,  due  to  the  first 
transmitted  signal,  is  given  by 

(5) 

We  define  an  M-dimensional  vector  s  =  [si  S2  •  sa/]^- 
The  output  of  the  correlator  on  the  mth  branch,  due  to  the  fcth 
transmitted  signal,  for  A:  >  1,  is  given  by 

N-1  OO 

•E  E  (6) 

f=0  A=--oo 

where  the  function  VK  )  the  output  of  the  chip  waveform 
through  the  chip-matched  filter,  i.e.,  - 

t)(Ls.  We  also  define  the  M-dimensional  vectors  = 
[ik  1  U,  2  •  *  A/]^*  We  denote  the  output  of  the  correlator 

'  Although  an  asynchronous  system  is  being  considered,  the  method  works 
equally  well  with  synchronous  systems  and  serves  to  extend  the  number  of 
users  to  beyond  N  in  synchronous  systems. 
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on  the  mth  branch  due  to  other  noise  by  rim.  and  similarly 
define  an  M-dimensional  vector  n  =  [ni  n2  •  •  *  ^ 

overall  output  of  the  correlators,  in  vector  form,  is  given  by 

K 

z  =  s  +  n  4-  ^  ifc-  (^) 

By  the  assumption  that  other  noise  contributions  are  inde¬ 
pendent  of  the  user  signals,  n  is  uncorrelated  with  s  and 
ifc  for  all  k.  Moreover,  it  is  easy  to  check  that  the  vectors 
ifc  are  uncorrelated  for  different  k  and  are  uncorrelated  with 
s.  Therefore,  the  noise  and  interference  correlation  matrix  is 
given  by 

Rni  =  Eo,  nn"  +  ^  ifcif  (8) 

fe=2 

and  the  overall  output  correlation  matrix  is  given  by 

Ra  =  jBa[zZ^]  =  Ea[ss^]  +  Rni  (9) 

where  Ea[  ]  denotes  the  conditional  expectation  given  a/b.m. 
for  fc  =  1,  •  •  • ,  and  m  =  1,  •  •  • ,  M,  and  the  superscript  H 
denotes  the  Hermitian  (conjugate  transpose)  operation. 

The  decision  statistic  for  the  symbol  6^^^  is  given  by  ^  = 
w^z.  We  assume  that  the  channel  coefficients  oik,m  1'k 
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Fig.  2.  The  »;(th  branch  of  the  receiver  for  weight  vector  determination. 


It  can  be  shown  [11]  that  the  optimal  weight  vector  that 
maximizes  the  last  expression  in  (12),  and  hence  the  SNR, 
is  given  by  the  generalized  eigenvector  associated  with  the 
largest  eigenvalue  of  the  matrix  pencil  (R*,  Rni)-  The  ma¬ 
trix  Rz  can  be  easily  estimated  from  the  outputs  z  of  the 
correlators.  It  remains  to  estimate  Rni- 

Due  to  the  special  structure  of  the  spread-spectrum  sig¬ 
nals,  it  is  possible  to  remove  the  desired  signal  component 
while  maintaining  the  same  statistics  of  the  noises  and  in¬ 
terference.  To  estimate  Rni,  we  pass  the  outputs  of  the 
chip-matched  filter  on  each  branch  through  another  com¬ 
biner,  ^  shown  in  Fig.  2.  On  the  mth  branch,  the  sequence 
(oo^\  is  chosen  to  be  orthogonal  to  the 

sequence  (4^\  a^,  •  •  • ,  For  example,  N  is  chosen 


vary  slowly  so  that  they  effectively  remain  constant  within  the 
time  interval  used  to  determine  an  appropriate  weight  vector. 

One  way  to  determine  a  weight  vector  is  the  approach  of 
maximal  ratio  combining  [3],  [10].  Each  component  of  the 
weight  vector  is  determined  separately  by 

•®a[^0  ^  ^"*1  _  V'2F*l.Ar!Z^Cm^O!l,m 

where  the  superscript  *  denotes  complex  conjugation,  var«[‘] 
denotes  the  conditional  variance  given  ak,m  for  all  k  and 
m,  Zm  is  the  mth  component  of  z,  and  [Rni]mm  is  the 
mth  diagonal  component  of  the  correlation  matrix  Rni-  We 
note  that  the  constant  gain  \/2PiNTc  can  be  omitted  without 
affecting  the  performance  of  the  MRC.  The  MRC  is  optimal  in 
the  sense  of  maximizing  the  SNR  when  noise  and  interference 
across  different  carriers  are  uncorrelated,  for  example,  when 
only  AWGN  is  present..  In  order  to  obtain  the  MRC,  the 
channel  coefficients  ak,m  have  to  be  estimated.  The  estimation 
of  these  coefficients  ^comes  more  difficult  when  MAI  is 
also  present.  Actually,  since  MAI  across  different  earners  are 
correlated,  the  MRC  is  not  optimal. 

In  this  paper,  we  determine  the  optimal  weight  vector  that 
maximizes  the  SNR  defined  by 

fiallw^Sp] 


SNR  = 


Equivalently,  we  find  the  weight  vector  that  maximizes 

_ .^q[|w^z[^] _ _  w^RgW 

r  /  MH  w"R„iw‘ 

I  w"  I  n -I- ^  ife  1 1 

\  k=2 


to  be  even,  and 


N 

forO  <i  <  — 

N 

for  —  <  i  <  N  —  1. 
2  -  - 


(13) 


Then,  at  the  output  of  the  second  combiner  on  each  branch, 
the  first  user  signal  component  is  nullified.  On  the  other  hand, 
it  is  straightforward  to  check  that  the  (second-order)  statistics 
of  the  outputs  due  to  other  transmitted  signals  and  noise  are 
the  same  as  those  of  the  first  combiner.  In  particular,  if  the 
overall  output  from  these  second  combiners  is  denoted  by  z, 
then  the  corresponding  correlation  matrix  is  given  by 


Ri  =  Ea[zz^]  =  Rni-  (14) 


Therefore,  Rni  can  be  estimated  from  z  and,  hence,  the 
optimal  weight  vector  can  be  determined. 

In  [5],  it  is  shown  that  the  generalized  eigenvector  associated 
with  the  largest  eigenvalue  of  a  matrix  pencil  of  the  form  given 
by  (Rz,  Rz)  can  also  be  found  by  a  constrained  stochastic 
gradient  algorithm  similar  to  the  least  mean  squares  (LMS) 
algorithm  but  without  a  training  sequence.  The  computational 
complexity  of  this  blind  adaptive  algorithm  is  of  the  order 
M  per  iteration,  which  is  much  smaller  than  the  calculation 
of  the  generalized  eigenvectors.  In  the  notation  of  this  paper, 
the  algorithm  can  be  expressed  as  follows.  For  each  symbol 
interval,  we  obtain  the  vectors  z  and  z  at  the  outputs  of  the 
2M  combiners.  For  the  jth  symbol  interval,  we  denote  them 
by  z(j)  and  z(j),  respectively.  We  denote  the  weight  vector 
in  the  jth  symbol  interval  by  w(j)  and  update  w(j)  for  each 
j  according  to  the  following  rule: 

For  j  >  0,  get  w(y)  by 

w(i)  =  C'(i){w(i  -  1)  -f  /x[z(i)"w(j  -  1)] 

-  [(z(j)"z(i))z(i)  -  (z(j)"z(j))z(j)]}  (15) 


SNR-l-1  = 
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where  C{j)  is  chosen  to  stabilize  the  algorithm  and  /x  is  the 
adaptation  step  size  that  controls  the  speed  of  convergence. 
Details  concerning  the  stabilization  constant  C{j),  the  adapta¬ 
tion  step  size  fi,  and  the  initialization  of  the  algorithm  can  be 
found  in  [4]  and  [5].  This  blind  adaptive  algorithm  has  been 
shown  to  be  effective  for  DS  systems. 


IV.  Performance 

When  the  channel  is  only  corrupted  by  AWGN  and  un¬ 
correlated  narrow-band  interference  without  MAI,  it  can  be 
readily  shown  that  the  noise  and  interference  components 
across  the  M  carriers  are  uncorrelated,  and,  hence,  the  MRC 
gives  the  optimal  weight  vector.  Under  these  conditions,  it  is 
straightforward  to  check  that  the  proposed  algorithm  reduces 
to  the  MRC.  However,  the  noise  and  interference  components 
are  not  always  uncorrelated,  especially  when  the  near-far 
problem  exists.  In  this  section,  we  examine  the  performance 
of  the  MC-CDMA  system  with  the  proposed  algorithm  in 
situations  with  the  near-far  problem.  It  turns  out  that  the 
proposed  algorithm  exploits  the  high  dimensional  space  and 
the  correlation  between  the  received  samples  provided  by 
multicarrier  modulation  to  cancel  MAI  and,  hence,  handles 
the  near-far  problem  in  a  very  effective  way. 

For  convenience,  we  define,  for  fc  =  1,  •  •  • ,  AT,  the  vectors 
dk  =  [dfc,i  dk^2  •••  dk,MV  by 

(16) 

for  m  =  1,  •  •  • ,  M.  Notice  that  since  ajb,  k  ^  ^  K 

and  m  =  1,  •  •  • ,  M,  are  i.i.d.  zero-mean  complex  Gaussian 
random  variables,  so  are  It  is  shown  in  [12]  that  the 

generalized  eigenvector  associated  with  the  largest  generalized 
eigenvalue  of  the  matrix  pencil  (Rz,  Rz),  i.e.,  the  optimal 
weight  vector  maximizing  the  SNR  defined  in  (11),  is  given 
by 


Rni^dl- 


(17) 


A,  Single  Interferer 

It  is  instructive  to  consider  the  case  of  jFC  =  2,  i.e.,  when 
there  is  only  a  single  strong  interferer.  Then,  the  noise  and 
interference  correlation  matrix  is  given  by 

Rni  =  NoN%l  +  /?2Ci2df  (18) 


where 

oo 

A=*-oo 

The  optimal  weight  vector  is  given  by 


(19) 


w, 


-  NoNTc 


N0NTC  +  P2W 


d2 


(20) 


and  the  resulting  SNR  is  given  by 


SNR. 


2PiNTc  ,2  _^|d£diL\ 

No  V  '  NoNTc -\-P2W)' 


(21) 


We  note  that,  if  di  and  d2  are  orthogonal,  then  the  optimal 
weight  vector  annihilates  the  contribution  of  the  interferer 
without  any  penalty  in  SNR. 

Geometrically,  the  contributions  of  the  users  are  represented 
by  the  vectors  di  and  d2  in  an  M-dimensional  space.  If  the 
vectors  are  perpendicular,  then  the  optimal  weight  vector  is 
di.  It  collects  energies  from  all  carriers  of  the  first  user  while 
rejecting  energies  from  the  second  user  by  destructive  inter¬ 
ference.  In  general,  di  and  d2  are  not  always  perpendicular. 
However,  as  shown  below,  in  a  high  dimensional  space,  it  is 
likely  that  they  are  nearly  perpendicular. 

In  a  Rayleigh  fading  channel,  dk,m,  for  k  -  1,  2,  and 
TO  =  1,  •  •  • ,  M,  are  i.i.d.  complex  Gaussian  random  variables 
with  zero  mean  and  unit  variance.  We  also  assume  that  T2 
is  uniformly  distributed  on  [0,  Tc).  We  define  an  outage 
probability  Po  as  the  probability  that  the  SNR  of  the  first  user 
suffers  more  than  3  dB  of  loss  due  to  the  interference  from 
the  second  user.  From  (21),  the  outage  probability  is  given  by 


p  /M  < 

2  -  NoNTc +  I32WJ 
I  2  -  |d2P  ) 


(22) 


The  bound  on  the  outage  probability  is  the  probability  that  the 
vectors  make  an  angle  between  0  and  45°  (or  between  135° 
and  180°).  We  note  that  the  bound  remains  valid  no  matter  how 
large  the  power  of  the  second  transmitted  signal  becomes. 

Conditioned  on  dp  we  perform  a  unitary  transformation 
so  that  di  is  transformed  into  d'^  =  [|dil)  0,  •  •  • ,  0]^,  and 
d2  is  transformed  into  some  vector  d^.  The  components  of 
dj  remain  i.i.d.  complex  Gaussian  random  variables  with 
zero  mean  and  unit  variance.  Hence,  the  conditional  outage 
probability  is  bounded  by 


where  <£2  m  component  of  d^.  The  random  variables 

X  =  i2^=2  l4,mP  and  y  =  14, il"*  are  independent 
and  are  chi-square  distributed  with  degrees  of  freedom  equal 
to  2(M  -*  1)  and  2,  respectively.  Their  probability  density 
functions  are 


fx{x) 


1 


r(M-i) 


for  X  >  0  (24) 


and 

fY{y)  =  e-y,  fovy>0  (25) 

where  r(  )  is  the  gamma  function.  By  direct  integration  and 
using  the  definition  of  the  gamma  function,  the  conditional 
outage  probability  is  found  to  be  bounded  by 

<Pr(X<y)  =  ^.  (26) 

Since  the  result  does  not  depend  on  the  vector  dp  the 
ditional  outage  probability  Po  is  also  bounded  by  1/2 
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number  of  interierers 

Fig.  3.  Average  optimal  SNR  with  multiple  interferers. 

We  can  see  from  (26)  that  the  outage  probability  drops  at  least 
exponentially  with  the  number  of  carriers.  For  example,  with 
eight  carriers  the  outage  probability  is  bounded  by  7.81  x  10”^. 

B,  Multiple  Interferers 

When  there  are  more  interferers  in  the  system,  the  SNR 
can  still  be  found.  Since  the  expression  for  the  SNR  for  the 
case  with  multiple  interferers  is  considerably  more  complex 
than  (21),  it  is  omitted  here.  Instead,  we  plot  the  “average” 
optimal  SNR  as  a  function  of  the  number  of  interferers 
in  Fig.  3  to  demonstrate  the  performance  of  the  proposed 
algorithm.  The  average  is  taken  over  500  realizations.  In 
each  realization,  the  fading  coefficients  for  different  users 
are  randomly  generated,  and  the  resulting  SNR  is  analytically 
determined  under  the  random  signature  sequence  model.  The 
power  of  each  interferer  is  20  dB  above  the  desired  user 
power,  and  the  average  signal-to-thermal-noise  ratio  (STNR  = 
2PiMNTclNo)  is  15  dB.  The  SNR  performances  for  three 
different  numbers  of  carriers,  namely,  M  =  1,  8,  and  16,  are 
shown.  In  each  case,  we  adjust  N  to  give  MN  =  256  so  that 
the  bandwidth  of  the  system  is  kept  constant. 

From  Fig.  3,  we  see  that  multicarrier  systems  together  with 
the  proposed  algorithm  cancel  MAI  when  there  are  a  moderate 
number  of  strong  interferers  in  the  system.  For  a  fixed  value 
of  M,  the  MAI  canceling  capability  decreases  as  the  number 
of  interferers  increases. 

V.  Incorporation  of  Antenna  Arrays 

We  note  from  [3]  and  Section  II  that,  given  the  coherence 
bandwidth  of  a  fading  channel  and  a  limit  on  the  system 
bandwidth,  the  number  of  carriers  M  cannot  be  arbitrarily 
large.  Although  it  is  shown  in  (26)  that  the  outage  probability 
decreases  exponentially  with  M,  for  some  fading  channels, 
the  MAI  canceling  capability  of  the  multicarrier  system  is  still 
inadequate.  It  is  shown  in  [4]  and  [5]  that  MAI  cancellation 
can  be  achieved  in  a  single-carrier  DS-CDMA  system  with 
a  linear  phase  antenna  array,  where  spatial  diversity  of  the 
users  is  exploited.  The  idea  can  also  be  applied  to  multicarrier 


systems  to  enhance  their  MAI  canceling  capability.  Actually, 
the  approach  in  Section  III  can  be  readily  generalized  to 
incorporate  the  use  of  antenna  arrays.  Suppose  that  an  antenna 
array  of  D  elements  is  used  for  signal  reception.  Then,  the 
received  signal  in  (4)  becomes 

K  M  _ 

=  E  E 

k  —  l  rn=l 
oo 

=  — oo 

+  (27) 

where  is  a  D-dimensional  vector  that  accounts  for  the 
direction  of  arrival  of  the  mth  carrier  of  the  kih  user.  The 
received  vector  r{t)  becomes  D-dimensional,  as  does  the  noise 
contribution  n{t). 

The  output  of  each  correlator  becomes  D-dimensional. 
Therefore,  on  the  mth  branch,  the  desired  signal  component 
Sm,  the  interference  from  the  fcth  transmitter  and  other 
noise  contributions  iim  are  all  D-dimensional  vectors.  We  can 
concatenate  the  D-dimensional  vectors  in  the  M  different 
branches  to  obtain  an  MD-dimensional  desired  signal  vector 
s.  Similarly,  we  can  obtain  the  MD-dimensional  noise  vector 
n  and  the  MD-dimensional  interference  vectors  ijt.  Therefore, 
we  obtain  (7)  with  all  the  vectors  becoming  MD-dimensional. 
Moreover,  s,  n,  and  i/t,  for  all  k,  remain  uncorrelated.  The 
optimal  vector  which  is 

now  MD-dimensional,  can  be  obtained  in  exactly  the  same 
way  as  before. 

As  in  Section  V,  it  is  illustrative  to  examine  the  outage 
probability  of  the  multicanier  system  with  an  antenna  array 
under  the  assumption  of  a  single  interferer.  For  simplicity,  we 
assume  that  the  angles  of  arrival  and  the  array  response  vectors 
of  the  desired  transmitted  signal  (or  the  interfering  signal) 
are  the  same  for  all  carriers.  We  assume  a  linear  array  of  D 
isotropic  elements,  which  are  separated  by  a  half  wavelength, 
is  being  used.  We  assume  that  the  bandwidth  of  the  system  is 
much  smaller  than  the  carrier  frequency,  and,  thus,  by  using  the 
same  linear  array,  the  array  response  vectors  for  all  carriers 
are  approximately  given  by 

dk,m  =.dfc  =  [l,  (28) 

for  fc  =  1  and  2,  and  m  =  1,  •  •  • ,  M.  These  are  the 
array  responses  due  to  the  desired  transmitted  signal  and  the 
interfering  signal,  respectively.  In  (28),  9x  and  02,  which  are 
modeled  by  two  uniform  random  variables  on  [-tt,  tt),  are  the 
incident  angles  of  the  two  signals. 

Similar  to  (21),  the  optimal  SNR  in  this  case  is 

where  Vi  =  di  (S>  di,  D2  =  d2  0  d2,  and  (8)  denotes  the 
Kronecker  product. 

We  again  consider  the  outage  probability  Po  of  the  multi¬ 
carrier  system  with  the  linear  array  as  the  probability  that  the 


NoNTc^P2\T>2\ 
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interfering  signal  causes  SNR*  to  drop  by  3  dB.  Similar  to 
the  steps  in  Section  IV,  from  (29),  we  have 


\V2? 


(30) 


Conditioned  on  di  and  <j)  ~  sin  0i  —  sin  we  let  U  be  the 
unitary  transformation  considered  in  Section  IV~A.  Then,  the 
unitary  transformation  defined  by  =  U  Id  x  d  transforms 
Vi  to  [\di\dj,  0,  •••,  0]^  and  P2  to  ci'2  0  d2.  Using  this 


transformation,  we  have 

(M 

m=2 


where 


0  <  gD{<t>)  = 


sin(i97r0/2) 

[Dsin(7r</>/2) 


<  1. 


(32) 


Again,  by  the  fact  that  the  random  variables  X)m=2  \^2,m\^ 
and  14,  iP  are  independent  chi-square  distributed,  it  is  easy 
to  show  that 


Af~l 


if  0  <  9d{(I>)  <  1/2 

(33) 

,  if  1/2  <  goW  <  1* 


1- 


1 


{4>:  1/2<pd(<^)£1} 


M-l 


/^(0)#  (34) 


where  is  the  density  function  of  <l>,  given  by 

'  (35) 


M<f>)  —  \  ■jr2(2  +  |<^|) 

u, 


otherwise. 


In  the  above,  K{x)  =  Jq  ^  du/  y/l  —  sin^  u,  for  |x|  <  1, 
is  the  complete  elliptic  integral  of  the  first  kind  [13]. 

For  M  =  8  and  £)  =  5,  we  obtain  Po  <  7.83  x  10“'*, 
which  is  about  10  times  smaller  than  the  bound  on  Po 
obtained  in  Section  V-A.  Since  both  of  the  bounds  in  (26) 
and  (34)  are  tight  when  the  interferer  power  is  strong,  we  see 
that  the  incorporation  of  an  antenna  array  in  the  multicarrier 
system  does  help  in  enhancing  its  MAI  canceling  capability. 
In  addition  to  enhancing  the  MAI  canceling  capability,  the 
multi-element  array  also  provides  an  additional  antenna  gain 
of  D,  which  can  be  obtained  easily  by  the  proposed  combining 
algorithm. 


VI.  Numerical  Examples 


In  this  section,  we  investigate  the  performance  of  the  pro¬ 
posed  multicarrier  scheme  under  different  channel  conditions 
by  Monte  Carlo  simulations.  Throughout  this  section,  the 
STNR  is  fixed  at  15  dB.  Moderate  values  of  A  =  32  and 
Af  =  8  are  used.  The  chip  waveform  employed  is  the  raised- 
cosine  waveform  with  a  roll-off  factor  of  0.1  [10],  i.e., 


sin(7rf/Tc)  cos(7ri/10rc) 

^  irt/Tc  1  -  tV25T2  ■ 


(36) 


All  curves  shown  are  the  results  of  averaging  over  500 
realizations.  We  use  dotted  lines,  broken  lines,  solid  lines,  and 
dash-dotted  lines  to  show  the  SNR’s  obtained  by  the  MRC, 
the  eigen  analysis  method,  the  blind  adaptive  algorithm,  and 
the  maximum  achievable  SNR  in  (11),  respectively. 

First,  we  consider  the  simple  case  of  a  fading  channel  with 
AWGN  only,  i.e.,  no  multiple-access  interference  or  narrow- 
band  interference.  The  results  are  shown  in  Fig.  4.  In  this 
case,  the  optimal  weight  vector  is  the  MRC  determined  by  the 
fading  coefficients.  As  expected,  the  obtained  SNR  is  about  15 
dB.  Without  any  knowledge  of  the  fading  coefficients,  both 
the  eigen  analysis  method  and  the  blind  adaptive  algorithm 
can  give  SNR’s  very  close  to  the  15-dB  limit.  In  this  simple 
case,  we  can  choose  the  adaptation  rate  of  the  blind  adaptive 
algorithm  to  be  faster  than  the  eigen  method  by  adjusting  fi 
in  (16).  It  can  be  seen  that  the  weight  vector  adapts  quickly 
and  the  SNR  climbs  to  within  0.5  dB  of  the  optimal  value  in 
less  than  20  symbol  intervals. 

Next,  we  consider  the  case  where,  in  addition  to  AWGN,  the 
fading  channel  is  also  contaminated  by  narrow-band  interfer¬ 
ence  occupying  one  of  the  frequency  bands  of  the  multicarrier 
signals.  The  power  of  the  narrow-band  interference  is  taken 
to  be  20  dB  stronger  than  the  total  power  from  all  carriers 
of  the  desired  signal.  Thus,  the  power  of  the  narrow-band 
interference  is  about  29  dB  above  the  power  of  the  desired 
signal  in  that  frequency  band.  The  results  of  the  simulation 
are  shown  in  Fig.  5.  In  this  case,  the  optimal  weight  vector 
is,  again,  the  MRC  determined  by  the  fading  coefficients, 
the  location,  and  the  power  of  the  narrow-band  interference. 
The  maximum  achievable  SNR  is  slightly  below  15  dB, 
which  shows  that  narrow-band  interference  can  be  effectively 
suppressed  with  suitable  choices  of  weight  vectors.  When  the 
required  knowledge  of  the  channel  and  the  interference  is  not 
available,  we  need  to  employ  either  the  eigen  method  or  the 
blind  adaptive  algorithm  to  obtain  the  optimal  weight  vector. 
In  this  case,  we  choose  the  adaptation  rate  of  the  blind  adaptive 
algorithm  to  be  similar  to  that  of  the  eigen  method.  It  can  be 
seen  that  the  SNR’s  climb  to  about  2  dB  from  the  optimal  value 
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Fig.  5.  Performance  under  fading  channels  with  narrow-band  interference. 


Fig.  6.  Performance  under  fading  channels  with  a  strong  interferer. 


in  about  20  symbol  intervals.  While  the  eigen  method  can 
attain  an  SNR  level  very  close  to  the  optimal  value,  the  SNR 
obtained  by  the  blind  adaptive  algorithm  is  2  dB  below  the 
optimal  value.  The  loss  in  performance  of  the  blind  adaptive 
algorithm  is  due  to  its  stochastic  nature  and  can  be  traded  off 
with  its  convergence  speed. 

It  is  well  known  that  DS  spreading  allows  multiple  users  to 
share  the  same  communication  channel  when  the  powers  of 
all  users  are  comparable.  However,  the  performance  degrades 
under  imperfect  power  control.  We  consider  the  worst  case 
scenario  where  the  near-far  problem  exists.  Fig.  6  shows  the 
case  where  there  is  an  interferer  with  power  40  dB  stronger 
than  that  of  the  desired  user.  We  see  that  the  optimal  SNR  is 
slightly  less  than  15  dB.  This  implies  that  the  interfering  signal 
can  be  effectively  cancelled.  However,  the  SNR  obtained 
by  the  MRC  is  about  2  dB,  which  is  far  from  optimal. 
For  comparison,  we  choose  the  adaptation  rate  of  the  blind 
adaptive  algorithm  to  be  similar  to  that  of  the  eigen  method. 
Again,  the  eigen  method  can  attain  an  SNR  level  very  close  to 
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Fig.  7.  Performance  using  an  antenna  array  under  fading  channels  with  a 
strong  interferer. 


the  optimal  value,  but  the  SNR  obtained  by  the  blind  adaptive 
algorithm  is  3  dB  below  the  optimal  value.  Nevertheless,  by 
using  either  one  of  the  methods,  we  can  get  an  SNR  of  at  least 
1 1  dB  in  about  20  symbol  durations. 

Finally,  we  consider  the  case  of  a  single  interferer,  where  a 
linear  antenna  array  of  three  elements  is  used.  The  simulation 
results  are  shown  in  Fig.  7.  In  this  case,  the  adaptation  rates  of 
both  of  the  eigen  method  and  the  blind  adaptive  algorithm  are 
slower  because  of  the  large  numbers  of  weights.  Otherwise,  the 
results  are  similar  to  the  previous  case,  except  for  an  additional 
5-dB  gain  due  to  the  antenna  array. 

VII.  Conclusions 

We  have  considered  a  signal  reception  scheme  in  MC- 
CDMA  systems.  We  have  shown  for  communication  over 
fading  channels  that,  by  suitably  combining  the  received 
signals  on  all  the  carriers,  the  proposed  receiver  can  com¬ 
bine  the  desired  signal  contributions  from  different  carriers 
constructively,  while  canceling  noise  and  MCI.  Moreover,  we 
have  proposed  a  blind  adaptive  algorithm  to  determine  an 
optimal  weight  vector,  which  is  used  to  combine  the  received 
signals  from  different  carriers  optimally.  The  approach  can  be 
generalized  to  incorporate  the  use  of  antenna  arrays  in  MC- 
CDMA  systems.  Simulations  show  that  the  proposed  algorithm 
performs  very  well  under  different  channel  conditions  includ¬ 
ing  situations  with  the  near-far  problem.  Thus,  MC-CDMA 
systems  with  the  proposed  algorithm  are  attractive  candidates 
for  future  CDMA  systems. 
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